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Foreword 


It is a distinct pleasure to have the opportunity to introduce Professor 
Malliavin’s book to the English-speaking mathematical world. 

In recent years there has been a noticeable retreat from the level of ab- 
straction at which graduate-level courses in analysis were previously taught 
in the United States and elsewhere. In contrast to the practices used in the 
1950s and 1960s, when great emphasis was placed on the most general 
context for integration and operator theory, we have recently witnessed 
an increased emphasis on detailed discussion of integration over Euclidean 
space and related problems in probability theory, harmonic analysis, and 
partial differential equations. 

Professor Malliavin is uniquely qualified to introduce the student to anal- 
ysis with the proper mix of abstract theories and concrete problems. His 
mathematical career includes many notable contributions to harmonic anal- 
ysis, complex analysis, and related problems in probability theory and par- 
tial differential equations. Rather than developed as a thing-in-itself, the 
abstract approach serves as a context into which special models can be 
couched. For example, the general theory of integration is developed at an 
abstract level, and only then specialized to discuss the Lebesgue measure 
and integral on the real line. Another important area is the entire theory 
of probability, where we prefer to have the abstract model in mind, with 
no other specialization than total unit mass. Generally, we learn to work 
at an abstract level so that we can specialize when appropriate. 

A cursory examination of the contents reveals that this book covers most 
of the topics that are familiar in the first graduate course on analysis. It also 
treats topics that are not available elsewhere in textbook form. A notable 
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example is Chapter V, which deals with Malliavin’s stochastic calculus of 
variations developed in the context of Gaussian measure spaces. Originally 
inspired by the desire to obtain a probabilistic proof of H6rmander’s theo- 
rem on the smoothness of the solutions of second-order hypoelliptic differ- 
ential equations, the subject has found a life of its own. This is partly due 
to Malliavin and his followers’ development of a suitable notion of “differen- 
tiable function” on a Gaussian measure space. The novice should be warned 
that this notion of differentiability is not easily related to the more con- 
ventional notion of differentiability in courses on manifolds. Here we have 
a family of Sobolev spaces of “differentiable functions” over the measure 
space, where the definition is global, in terms of the Sobolev norms. The 
finite-dimensional Sobolev spaces are introduced through translation op- 
erators, and immediately generalizes to the infinite-dimensional case. The 
main theorem of the subject states that if a differentiable vector-valued 
function has enough “variation”, then it induces a smooth measure on Eu- 
clidean space. 

Such relations illustrate the interplay between the “upstairs” and the 
“downstairs” of analysis. We find the natural proof of a theorem in real 
analysis (smoothness of a measure) by going up to the infinite-dimensional 
Gaussian measure space where the measure is naturally defined. This in- 
terplay of ideas can also be found in more traditional forms of finite- 
dimensional real analysis, where we can better understand and prove for- 
mulas and theorems on special functions on the real line by going up to the 
higher-dimensional geometric problems from which they came by “projec- 
tion”; Bessel and Legendre functions provide some elementary examples of 
such phenomena. 

The mathematical public owes an enormous debt of gratitude to Leslie 
Kay, whose superlative efforts in editing and translating this text have been 
accomplished with great speed and accuracy. 


Mark Pinsky 

Department of Mathematics 
Northwestern University 
Evanston, IL 60208, USA 


Preface 


We plan to survey various extensions of Lebesgue theory in contemporary 
analysis: the abstract integral, Radon measures, Fourier analysis, Hilbert 
spectral analysis, Sobolev spaces, pseudo-differential operators, probabil- 
ity, martingales, the theory of differentiation, and stochastic calculus of 
variations. 

In order to give complete proofs within the limits of this book, we have 
chosen an axiomatic method of exposition; the interest of the concepts in- 
troduced will become clear only after the reader has encountered examples 
later in the text. For instance, the first chapter deals with the abstract inte- 
gral, but the reader does not see a nontrivial example of the abstract theory 
until the Lebesgue integral is introduced in Chapter II. This axiomatic ap- 
proach is now familiar in topology; it should not cause difficulties in the 
theory of integration. 

In addition, we have tried as much as possible to base each theory on the 
results of the theories presented earlier. This structure permits an econ- 
omy of means, furnishes interesting examples of applications of general 
theorems, and above all illustrates the unity of the subject. For example, 
the Radon-Nikodym theorem, which could have appeared at the end of 
Chapter I, is treated at the end of Chapter IV as an example of the theory 
of martingales; we then obtain the stronger result of convergence almost 
everywhere. Similarly, conditional probabilities are treated using (i) the 
theory of Radon measures and (ii) a general isomorphism theorem show- 
ing that there exists only one model of a nonatomic separable measure 
space, namely R equipped with Lebesgue measure. Furthermore, the spec- 
tral theory of unitary operators on an abstract Hilbert space is derived from 
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Bochner’s theorem characterizing Fourier series of measures. The treatment 
in Chapter V of Sobolev spaces over a probability space parallels that in 
Chapter III of Sobolev spaces over R”. 

In the detailed table of contents, the reader can see how the book is 
organized. It is easy to read only selected parts of the book, depending on 
the results one hopes to reach; at the beginning of the book, as a reader’s 
guide, there is a diagram showing the interdependence of the different sec- 
tions. There is also an index of terms at the end of the work. Certain parts 
of the text, which can be skipped on a first reading, are printed in smaller 
type. 

Readers interested in probability theory can focus essentially on Chap- 
ters I, IV, and V: those interested in Fourier analysis, essentially on Chap- 
ters I and III. Chapter III can be read in different ways, depending on 
whether one is interested in partial differential equations or in spectral 
analysis. 

The book includes a variety of exercises by Gérard Letac. Detailed solu- 
tions can be found in Exercises and Solutions Manual for Integration and 
Probability by Gérard Letac, Springer-Verlag, 1995. The upcoming book 
Stochastic Analysis by Paul Malliavin, Grundlehren der Mathematischen 
Wissenschaften, volume 313, Springer-Verlag, 1995, is meant for second- 
year graduate students who are planning to continue their studies in prob- 
ability theory. 


March 1995 P.M. 
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Prologue 


We recall briefly the definition and properties of the usual integral of con- 
tinuous functions on R. 

The concepts involved are elementary and well known. However, since 
this integral will be used to construct the Lebesgue integral, we sketch a 
few facts for convenience. 

Given the segment [0,1] C R, a partition of [0,1] is a finite subset a of 
(0, 1] containing 0 and 1. The partition 7’ is said to be finer than 7 if 7’ D 7. 
Let 0 = t, < te <... < tp, < tp = 1 (r = card(m)) be an enumeration 
of the points of 7. With every function f continuous on [0,1], we associate 
the sum 


r—-1 
sx(f) = D_ (thai — te) f(ta)- 
k=1 


This is a positive linear functional: 


Sn(fi + fo) = Sr(fi) + n(f2) and s,(f)>0 if f>0. 


The number 6(7) = sup(tx41 — tx) is called the diameter of the partition 
am. We have the following statement. 
Given a continuous function f, for every € > 0 there exists n such that 


Sa yaael pl <e 


for any partitions x and x’ satisfying 6(1) <n and 6(x’) < 7. 
Indeed, since f is continuous on the compact set [0,1], it is uniformly 
continuous. Hence we can find 7 such that | f(x) — f(2’)| < § if|2—2’| < 7. 


XxX Prologue 


Let m” = Ua’. Then, writing 7/ = 7’ U[tx41—tg], where t),...,¢, denote 
the points of the subdivision of 7, 


"=Um and sqo(f)= Yost) 


Moreover, 
Isa (f) — (teoi — te) f(te)| < 5 (tins — tk), 
whence : P 
Isn(f) — Sav (f)| < 3 So (tet -t) = 3 
and 


Isa(f) — se(DI< 545 =6 


Choosing a sequence 7m, of partitions such that 6(7,) — 0, we find that 


Sx,(f) is a Cauchy sequence whose limit is independent of the choice 7p. 
Set 


y f(x) dx =lims,, (f). 
0 


Then the integral is a positive linear functional. In particular, 


~ fle)da] < | F(a) dz < max|f(2)|. 


The change of variable x = a + t(b— a) reduces the integral over [a,b] to 
the preceding case: 


b 1 
| fla)ae == f fla+t(b—a)) dt 


Differentiation. Let f be continuous. Set 
om 
F(x) = f(t) dt. 
0 


Then F is differentiable and F’(x) = f(x). Evaluating integrals of contin- 
uous functions is reduced to finding primitives. 

Improper integrals. Integrals will be evaluated either on all of R or on [0, 1]. 
The functions we integrate on R will be continuous; those we integrate on 
[0,1] will be continuous on (0,1). The elementary procedure consists of 
passing to the limit: 


ys 
/ = lim We [- lim . 
n— +00 n— +00 


Prologue Xx1 


We have the concepts of convergence and of absolute convergence. The 
Lebesgue theory will be developed in the second setting: every Lebesgue 
integrable function will have Lebesgue-integrable absolute value. For this 
reason, we consider here only absolutely convergent improper integrals. The 
following results can easily be proved by calculating primitives. 

If f is continuous and positive on R and if f(x) ~ |z|~*% as 
then the integral of f on R exists if and only if a > 1. 

If f is continuous and positive on (0,1) and if f(x) ~ |z|~? as x = 0, 
then i, f exists if and only if 8 < 1. 

These results generalize to R” by passing to polar coordinates. We find 
in the first case that a > n, and in the second that 6 < n. (In the second 
case, we integrate a function continuous on R” and zero outside a compact 
set.) 


x| — +00, 


I 


Measurable Spaces 
and Integrable Functions 


Introduction 


In this chapter, we follow an axiomatic method of exposition. The interest 
of the concepts introduced will not appear until Chapter II. We introduce 
the notion of a measure space, a space endowed with a family of measurable 
subsets satisfying the axioms of a o-algebra. This approach parallels that 
of the theory of topological spaces, where a topological space is a space 
endowed with a family of open subsets. As we will see in Chapter IV, a 
peculiarity of the concept of a o-algebra is that it is adapted to the propo- 
sitional calculus (Boolean algebra). Since negation is an operation of this 
calculus, this leads to the axiom that the complement of a measurable set is 
measurable. The fact that o-algebras are closed under taking complements 
is an essential difference between the family of open sets of a topological 
space and the family of measurable sets of a measure space. In order to 
be able to take limits of sequences, we impose another axiom: A countable 
union of measurable sets is measurable. 

Having defined the concept of a measurable space, we introduce a class of 
morphisms adapted to it: the measurable mappings. We introduce a natural 
measurable structure on a topological space: the Borel structure. Continu- 
ous mappings are thus special cases of measurable mappings. A remarkable 
result is that the limit of a pointwise convergent sequence of measurable 
mappings is itself measurable. Thus all the functions appearing in prac- 
tice in mathematical analysis are measurable functions. A measure space 
is a measurable space which is given a “mass distribution”. The concept 
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of negligible sets, or sets of measure zero, is introduced; two measurable 
mappings are considered equivalent if they differ on a negligible set. 

We introduce the concept of convergence in measure, which gives a com- 
plete metric space structure to the space M of equivalence classes of mea- 
surable mappings from a measure space to a complete metric space. When 
we consider functions on a measure space, i.e. mappings with values in R, 
we introduce simple functions, those that assume finitely many values. The 
integral, defined trivially on certain simple functions, extends to an appro- 
priate completion, which defines the space L of integrable functions. The 
theorems on passage to the limit under the integral sign are then an easy 
consequence of the fact that L! is a complete space. The chapter concludes 
with Fubini’s theorem and the duality between L? spaces. 


1 o-algebras 


Let X be an abstract set. A a-algebra on X is a family A of subsets of X 
satisfying the following three axioms: 


1.0.1 The set X belongs to A. 
1.0.2 If A € A, its complement A‘ € A. 


1.0.3 Every countable union of sets in A belongs to A; t.e., if An € A 
Wn EN, then (U,-NAn) € A. 


A Boolean algebra on X is a family B of subsets of X satisfying 1.0.1, 
1.0.2, and 


1.0.4 Every finite union of sets in the algebra B is in B. 
Every o-algebra is thus a Boolean algebra. By using Axiom 1.0.2 and 
passing to the complement, we find that 1.0.3 implies 


1.0.5 If A, € A, then (O,eN An) € A. 

An analogous statement is obtained for Boolean algebras by restricting 
to finite intersections. In what follows, we will not pursue the parallels 
between Boolean algebras and o-algebras, but the reader should note that 
most theorems involving passage to the limit are false for Boolean algebras. 


1.1 Sub-o-algebras. Intersection of o-algebras 


Given two o-algebras A and A’ on the abstract set X, we say that A’ is 
a sub-o-algebra of A if A € A’ implies A € A. More formally, let P(X) 
denote the set of subsets of X. We may view a o-algebra A on X as a 
subset of P(X). The “order relation” between o-algebras corresponds to 
the relation of inclusion between the subsets of P(X). 


1.1.1 More generally, if G is an arbitrary family of subsets of X and A is a 
o-algebra on X, we say that A DG if A € G implies A € A. 
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1.1.2 Intersection of c-algebras 


Definition. Let {A,,a € I}, be a family of c-algebras on X. We denote 
by A’ = NaerAa the family of subsets of X defined by A € A’ if and only 
if A € A, for alla € J. A’ is a o-algebra called the intersection of the Ag. 

We verify only 1.0.3, the other axioms being even more obvious. Let 
An € A’, set 

Z= UneNAn: 

and fix ag. Since A, € Ag, and Ag, satisfies 1.0.3, it follows that Z € Aa,. 
As this is true for all ap, we conclude that Z € A’. 


1.2 o-algebra generated by a family of sets 


1.2.1 Theorem. Let G be a family of subsets of X. Then there exists on 
X a smallest o-algebra containing G. 


PROOF. Consider the o-algebras B on X such that 
(P) BOG. 


Let J denote the family of o-algebras B satisfying (P), and set Ag = 
Npe,B. Then Ap is a o-algebra by 1.1.2, and it is the smallest o-algebra 
of the family I. 0 


1.2.2 Definition. Apo is called the o-algebra generated by G. We say that 
G is a system of generators of Ag. 


1.2.3 Fundamental example: Borel algebras 


Let X be a topological space and let Ox be the family of open subsets of 
X. The g-algebra generated by Ox is called a Borel algebra, and written 
Bx. 

An element of Bx is called a Borel set. Open sets are Borel sets, as are 
closed sets (as complements of open sets). The family of closed sets could 
equally well be taken as a system of generators of By. 


1.3 Limit of a monotone sequence of sets 


1.3.1 Definition. Let A, be an increasing sequence of subsets of X. We 
call the union of the A,, the limit of the sequence Ay, and we set 


Aco =limt An =(JAn, where An C Angi. 
n 
Similarly, given a decreasing sequence B,, of subsets of X, we call the 


intersection of the B,, its limit: 


Bo =lim | Bn, = (7) Bars where B, D Bn41. 


n 
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A sequence of subsets of X is monotone if it is either increasing or de- 
creasing. 


1.3.2 A monotone class is a family M of subsets of X such that if {Ay} is 
a monotone sequence for which A, € M for each n, then its limit is in M. 


1.3.3 Proposition. A o-algebra is a monotone class. 


Proor. Let {A,,} be an increasing sequence of sets in the o-algebra A. 
Then by 1.0.3 
lim ? An =|_J An € A. 


Similarly, 1.0.5 proves the statement for decreasing sequences. 


1.3.4 An arbitrary intersection of monotone classes is a monotone class. 
Thus, given a family Z of subsets of X, there exists a smallest monotone 
class Mo containing Z. Mg is called the monotone class generated by Z. 


1.4 Theorem. Let By be a Boolean algebra of subsets of X, M the mono- 
tone class generated by By, and B the a-algebra generated by By. Then 
B=M. 

PROOF. By 1.3.3, B is a monotone class. Since B contains By, it contains the 


smallest monotone class containing B,; thus B > M. 
Conversely, for all A € P(X), let 


1.4.1 &(A) = {Be P(X): AUB,A—B,B- Ae M}. 


Then the assertions B € ®(A) and A € ®(B) are equivalent. 
Fixing A, we show that ®(A) is a monotone class. Indeed, if B, is an increasing 
sequence of elements of ®(A), then 


B, — A is an increasing sequence of elements of M, 


AUB, is an increasing sequence of elements of M, 
A—B, is a decreasing sequence of elements of M, 


and their limits are elements of M. Furthermore, 
lim tT (AU B,) = AU lim T Bn, 


whence lim 7 Bn € ®(A). 
Let Ao € By; then Bo € ®(Ao) for all Bo € By. Hence ®(Ao) is a monotone 
class containing B,. Thus ®(Ao) > M, or B € ®(Ao) for any Ao € By, BEM. 
Conversely, Ao € ®(B); i.e., ®(B) D By for any fixed B € M. 
Since 6(B) is a monotone class, it follows that ®(B) > M. 
We have proved that 


1.4.2 B— B’, B'’- B, BUB’ © M whenever B, B’ € M. 
Taking B’ = X shows that B° € M if B € M, and thus 
1.4.3 M is a Boolean algebra. 


The following lemma, 1.4.4, implies that M is a o-algebra. Since M 5 By, M 
contains the a-algebra generated by By; hence BC M. 0 


1 o-algebras 5 


1.4.4 Lemma. Let Z be a Boolean algebra which is closed under increasing 
limits. (That is, if Z, is an increasing sequence of elements of Z, then 
lim f Z, € Z.) Then Z is a o-algebra. 


PROOF. Let An € Z and set Zn = Ui<p<nAp; then 
Un An = UnZn = lim T Zn € Z, 


and Axiom 1.0.3 is satisfied. 


1.5 Product o-algebras 


Definition. Let X,, X2 be abstract sets equipped with o-algebras A, Ao, 
and let the Cartesian product X, x X2 be denoted by X. 


1.5.1 A rectangle R is a subset of X of the form 
R=A,x Ao with A;€ A; (a = 1, 2). 


The set of all rectangles is denoted by R. 


1.5.2 The c-algebra generated by R is called the product o-algebra and 
denoted by A; @ Ag. 


1.5.3 The union of a finite number of disjoint rectangles is called an ele- 
mentary set. The family of elementary sets is denoted by €. 


1.5.4 Proposition. The elementary sets form a Boolean algebra. 


PROOF. Note first that the union of a finite number of disjoint elementary sets 
is an elementary set. 
Let R = A; x Ag, R! = Aj x A} be two rectangles; then 


(R)° = (At x X2) N(X1 x Ad). 


Hence 
R’—-R=R,UR.UR3;, 


where Ry = (ALN A}) x (A2 Ad), Re = (Ain Aj) x (ARN Ad), and R3 = 
(AS NA‘) x (ASA AS). Thus 


(2) R'—R_ is an elementary set. 


Let E = RURz4 be an elementary set that is the union of two disjoint rectangles. 
(We restrict to two in order to simplify notation.) 


R!'—E = (R’—R)—Rs = (Ri UR2UR3)— Ra = (Ri — Ra) U(R2— Ra) U(R3— Ra). 
Applying (i), we obtain 

(22) R'—E is anelementary set if ECE, R' ER. 
If E’ € € then E£’ = UR; (R; disjoint) and E’ — E = U(R; — E), whence 

(iit) (E’-E)e€€ forany E,E’ €€. 
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Taking FE’ = X, x X2, we obtain 1.0.2. Furthermore, 
(A; x A2) ial (Aj x Ad) = (Ai nN Aj) x (Ag M Ad). 

Hence the intersection of two rectangles is a rectangle and, more generally, 
(iv) EnNE'¢€€ if E,E’ EE. 

Indeed, if E = UR; and E’ = UR;, then EN E’ = Uj.q(Rj 1 R;). (Note that 
the sets Rj Rj are disjoint.) 

Finally, 
(v) BUE =(B-E))U(E’-E\U(EnNE’. 

The three quantities in parentheses on the right-hand side are elementary sets 
by (ii) and (iv); since they are disjoint, EU E’ € € and 1.0.4 is satisfied. 
1.5.5 Corollary. The o-algebra A, © Ag is the monotone class generated 
by the elementary sets. 


PROOF. 1.5.4 and 1.4. 


2 Measurable Spaces 


2.1 Inverse image of a a-algebra 
Let X, X’ be abstract sets and let f be a mapping from X to X’. Let Gg’ 


be a family of subsets of X’. We write 
FUG) ={AEP(X):A=f '(A) with A’eg’}. 
2.1.1 Proposition. Let A’ be a c-algebra on X'; then f~'(A’) is a o- 
algebra on X. It is called the inverse image of A’ under f and denoted by 
A= pA. 
Proor. The inverse image of X’ is X. In addition, 
Usf (AL) = fot(UsA4) (Axiom 1.0.3 is satisfied); 
Rene. 28 CAS) (Axiom 1.0.2 is satisfied). 
2.1.2 Taking the inverse image preserves inclusion between a-algebras: 
f-(Al) > fo AD whenever A; SA): 
2.1.3 EXAMPLE. Let Y be a subset of the set X’, let i be the canonical 
injection of Y into X’, and let A’ be a o-algebra on X’. Then 
(A) = {Ber ):t (8) eA} 
= {BeEP(Y):JA’ eA’ such that A’ NY = B}. 
In this special case, i~!(.A’) is called the trace a-algebra of the a-algebra 
A’ on the subset Y. 


Since Y is a subset of X’, every subset of Y can be identified with a 
subset of X’. It is easy to verify that 


(i) iA re es re 
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2.1.4 Transitivity of inverse images 


Suppose that X, X’, and X” are three abstract sets, f and h are mappings 
such that X 4x74 X", and G” is a family of subsets of X”. Then 


FRG") = (ho fy 1G". 


2.2 Closure under inverse images of the generated o-algebra 


2.2.1 Theorem. Suppose that X and X’ are abstract sets, f is a mapping 
from X to X', G’ is a family of subsets of X', and A’ is the a-algebra 
generated by G’. Then f—1(A’) is the a-algebra generated by f~1(G’). 


PROOF. Let B denote the o-algebra generated by f~1(G’). 
Bc f7'(A) since f-1(G’) c fo’). 
To prove that BD f~1(A’), we let 


B'={B’c X’': f-1(B’) € B} 
and prove that B’ is a o-algebra. 


(i) f7'(X’) = X € B; hence X’ € B’. 
(ii) Let B’ € B’; then f-!(X’ — B’) = X — f-!(B’) € B since B is 
a o-algebra . 
(iii) Let By, € B’; then f~!(Un Bi) = Unf7!(Bi) € B. 


B' DG’; hence B’ contains A’, the o-algebra generated by G’. Let A’ € A’. 
Then A’ € B’ since B’ D A’. Hence f~!(A’) € B. 0 


2.3 Measurable spaces and measurable mappings 


2.3.1 Definition. The pair (X,.A) consisting of a set X together with a 
a-algebra A of subsets of X is called a measurable space. 


2.3.2 Definition. Given two measurable spaces (X,.A) and (X’, A’), a map- 
ping f of X to X’ is called measurable if f~!(A’) C A. 

M((X, A); (X', A’) will denote the set of measurable mappings of (X, A) 
into (X’, A’). 


2.3.3 Proposition. The composition of measurable mappings is measur- 
able. 


PROOF. Let fi € M((X,A);(X',A’)), fo € M((X', A’); (X",.A”)). Then 
by 2.1.4 f = foo fy satisfies (foo fi)-1(A") = fr (fo (A) C fp (A) 


A, and hence fo 0 f; is measurable. 0 
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2.3.4 Proposition (Measurability criterion). Let (X,A) and (X', A’) 
be measurable spaces, let A’ be the a-algebra generated by G, and let G’ Cc 
A’. Then the following are equivalent: 


(i) f € M((X, A) (X", A’) 
(i) FUG)CA. 
Proor. Let A; be the o-algebra generated by f~!(G’). Then (ii) is equiv- 


alent to A, C A. Furthermore, A; = f~1(A’) by Theorem 2.2.1; hence (ii) 
is equivalent to (i). 0 


2.3.5 Measurable mappings into a product 


Let (X,.A), (¥1, Bi), and (¥2, Bz) be measurable spaces. Let Y; x Yo be 
given the product o-algebra B,; ® Bo, defined in 1.5.2, and let 7; (i = 1,2) 
be the natural projection of Y; x Y2 onto Yj. 


Lemma. 7 € M((¥1 x ¥5,B, & Bo); (Y1, B1)). 


PROOF. We must consider a, (Bi), where B, € B,. But my '(Bi) = B,xY> 
is a rectangle, and hence an element of 6; ® By. O 


Proposition (Measurability criterion for a mapping into a prod- 
uct). Let f be a mapping of X into Y; x Y2. Then f is measurable if and 
only if its components f; = 7; 0 f (t=1,2) are measurable. 


PROOF. Suppose that f is measurable. Then, by the preceding lemma, 
7,0 f is a composition of measurable mappings and hence measurable. 
Conversely, suppose that f; and f2 are measurable and let R = By, x Bo 
be a rectangle. Then f7!(R) = f7'(Bi) 9 fy '(B2). Each f7'(B;) is in 
A, hence so is their intersection, and the measurability criterion 2.3.4 then 
shows that f is measurable. 


2.4 Borel algebras. Measurability and continuity. 
Operations on measurable functions 

2.4.1 Separability and measurability 

Separability of topological spaces 

Let Y be a Hausdorff space. 


(i) Y satisfies the first separability axiom if there exists a subset D of Y which is 
countable and dense in Y (closure of D = Y). 


(ii) Y satisfies the second separability axiom if there exists a countable family of 
open subsets H; such that every open set in Y may be written as a union of the 
H; that it contains. The family H; is called a basis of open sets for Y. 


(iii) EXAMPLE. Let Y = R and let Q be the set of rational numbers. Setting 
Ho,.92 = (91.42), we obtain a countable family of intervals. Then every interval 
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(x1, £2) can be written as a union of the H; that it contains. The same holds for 
any open set. 


(iv) Proposition. Let Y be a metric space satisfying the first separability axiom. 
Then it satisfies the second. 


PROOF. Let {y;} be a dense sequence in Y. We denote by d the distance on Y 
and set Him = {y € Y : d(y, yi) <m7'}, where m € N. For each open set O in 
Y, let O’ be the union of the Him contained in O. Then O’ is an open subset of 
O. Let z € O. Then there exists mo such that the ball with center z and radius 
mg ' is contained in O. Let j be such that d(y;,z) < (2mo)7}. Then z € Hj,2mo, 
and hence O c O’. 


(v) The space R” satisfies the first separability axiom and hence the second. 


(vi) The second separability axiom implies the first. It suffices to choose a 
point y in each H; to obtain a dense sequence. 


Because of (vi) and (iv), we refer to a metric space which has a dense 
sequence as a separable metric space. 


(vii) Let Y, Y’ be two separable metric spaces. Then their product Y” is 
separable: Set yi, = (yj, yi); then the {y/’,,} form a countable dense subset 
of Y”. 


(viii) Proposition (Measurability criterion). Suppose that (X,.A) is a 
measurable space, Y is a topological space satisfying the second separability 
aziom, and H; is a basis of open sets of Y. Then a mapping f: X — Y is 
measurable if and only if 


fO(H) eA, ieN. 


PROOF. This follows immediately from the measurability criterion 2.3.4. It 
must be shown that, for every open set O, f~'(O) € A. Let O = U,Hi,; 
then f~1(O) =U.f—!(Hi,) € A. 0 


REMARK. (viii) provides an explicit criterion for the measurability of a 
function. 


2.4.2 Product of Borel algebras 


Proposition. Consider two separable metric spaces X, and Xo and their 
product Y = X1 x X2. Let Y be equipped with the product topology. Denote 
by By, Bo, and By the associated Borel algebras. Then By = B, ® Bo. 


PROOF. Y is separable by 2.4.1. The family of open sets of the product 
topology is generated by the countable unions of open rectangles: Ry = 
O, x Og, where O; € Ox,. Hence Ro € B, ® Bo; that is, Oy C By ® Bo. It 
follows that By C B, @ Bo. 

Let 7, be the projection of Y onto X;. Then m € M((Y, By); (X1, B1)) 
since 7; '(Ox,) C By. 


10 I. Measurable Spaces and Integrable Functions 


It follows from 2.3.5 and the fact that 7; 07 (j = 1,2) is measurable 
that the identity mapping 7 : (Y, By) + (Y,B, ® Bz) is measurable. Thus 
7 (By © Bz) C By, or B} ®@ By C By. O 


2.4.3 Measurability and continuity 


Let X and X’ be topological spaces. Equipping them with their Borel 
algebras Bx and Bx-, we obtain measure spaces (X, Bx) and (X’,Bx:). 


Proposition. Every continuous mapping f from X to X' is a measurable 
mapping from (X,Bx) to (X’, Bx:). 


PROOF. We use the measurability criterion 2.3.4. It must be shown that 
f7'(Ox-) C Bx. But since f is continuous, the inverse image of an open set 
is open, whence f~!(Ox-) C Ox. Since Ox C Bx, the conclusion follows. 


2.4.4 Algebraic operations on measurable functions 


Consider the field of real numbers R with its Borel algebra BR. Given 
a measurable space (X,A), we denote by £°(X,.A) the set of measurable 
mappings from (X,.A) to (R, Br). Elements of £°(X,.A) are called measur- 
able functions. When X is a topological space with its Borel algebra Bx, 
elements of £L°(X, Bx) are often called Borel functions. 


Proposition. The absolute value of a measurable function f is measurable. 
The sum and product of two measurable functions are measurable. The 
multiplicative inverse of a measurable function which is everywhere nonzero 
is measurable. 


PROOF. Let u be the mapping from R to R defined by the absolute value: 
u(¢) = |¢|. Then u is continuous, hence measurable, and 2.3.3 implies that 
|f| = uo f is measurable. 

Let ® be the continuous mapping of R? — R defined by ®(¢),¢2) = 
C1 + C2. Similarly, let Y(¢1, C2) = 12. 

Let f; and fz be measurable functions on X, and let F(x) = (fi(x), fo(a)). 
Then F : X > R? and, by 2.3.5, 


Fe M((X, A); (R’, Br @ Br)). 


By 2.4.2, Bk ® BR = Bz; hence F € M((X,A),(R’, Baz)). Since © is 
continuous, ® € M((R?, Brz); (R, Br)). Thus, by 2.3.3, 


®o F € M((X, A); (R, Br)) = £L°(X, A). 


But (®o F)(x) = fi(x) + fo(z). 

Similarly, Uo F € £°((X,A)) and (Wo F)(x) = fi (x) fo(z). 

We denote R — {0} by R’. Let 7 be the continuous mapping of R’ > R’ 
defined by n(¢) = 2 and let f € £°(X, A), f(x) #0 for all « € X. If O is 
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an open set in R, then O' = OMR’ is an open set in R’. Set g(x) = Fay 
Then g~'(O) = g-1(O’) = f~'(n71(O’)). Since n~!(O’) is an open set 
in R’ and R’ is open in R, 7~!(O’) is open in R. Since f is measurable, 


FM (O) EAT 


2.5 Pointwise convergence of measurable mappings 


In this section, (X,.A) denotes a measurable space, Y a metric space, and 
By the Borel algebra of Y. We say that a sequence of mappings f, : XH» Y 
converges pointwise to fo if lim f, (x) = fo(x) for every x € X. 


2.5.1 Theorem. Let f,, be a sequence of measurable mappings which con- 
verge pointwise to fo. Then fo is measurable. 


REMARK. It is well known that the pointwise limit of a sequence of contin- 
uous functions is not necessarily continuous. This theorem shows the great 
stability of the property of measurability. 


PRooF. Let fy € M((X, A); (Y,B)). Let d denote the distance in Y and 
let O be an open set in Y. For every k > 0, let 


On= {2 EO: da0%)> 7h. 


Then O, is an increasing sequence of open sets in O and O = U keNO«- 
Moreover, denoting by O; the closure of O;, we have Ox C Opa. 
Since d(fo(x), fm(x)) — 0, it follows that 


fo(x) € On => fa(z) € Ox if ¢ is large enough, say g > mo. 


Set Hing = Mq>mofq (Ox). Since fy is measurable, each f>'(Ox) € A, 
whence Hf, € A. Let Ge = Um Hf,,; then G* € A. 

We have thus shown that fo(z) € O, > x € G**! or, taking the union 
over k, fo(z) € O 2 €U,-NG’, which may be written as 


(2) fo'(O) CW, where W= UneNG’ € A. 


We now prove the reverse inclusion. Let 2; € G". Then there exists m, 
such that rz € H7,,, or 21 € fr @®) if q > m,. Thus lim fy(a1) € O, C 
O,+1 C O and therefore 


(#2) fo (0) DW. 


From (i) and (ii) it follows that fj '(O) = W, or W € A, whence fo is 
measurable. 0 
For emphasis, we restate (i) and (ii) in the following form. 
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2.5.2 Fundamental lemma. Let {f,,} be a sequence of mappings from X 
to the metric space Y that converges pointwise to fo. Then for every open 
setO inY, 


fy (OVS | (}e ee) , where On= {re 0:d(e,0%) > zh. 


rm |q>m 


2.6 Supremum of a sequence of measurable functions 


For convenience of notation, we introduce in this section the set R of real 
numbers completed by adjoining the two elements +00 and —oo. 

Addition and multiplication in R are defined in the elementary way, 
except for the “indefinite forms” +-co — —oo and 0- oo. 


R is given the obvious order relation, with +-oo the largest and —oo the 
smallest element. A distance is defined on R by setting 


d(a,x') = |Arctanz — Arctan2’|. 


Every subset of R has a supremum, or least upper bound. The empty 
set is assigned the supremum —oo. 


2.6.1 Proposition. Let {fn} be a sequence in M((X, A); (R, Bx)) and let 
yp =sup fr. Then py € M((X, A); (R, Bg))- 


PROOF. Since {+00} is a closed subset of R, f71({+o0}) € A. Set G = 
Unf, '({+00}). Then G € A and y(x) = +00 if € G. 

Let X’ = G°, equip X’ with the trace A’ of the o-algebra A, and denote 
by f/, the restriction of f, to X. Then 


fr € M((X', A’); (R, Br)) = £°(X", A’). 


Moreover, by 2.4.4, sup(f{, f3) € £L°(X", A’). 

More generally, let the sequence {g,} be defined by recursion: g; = f; 
and gy = sup(f;,9r—-1) if k > 1. 

An induction argument shows that gz4, € £°(X',A’). Moreover, gx < 
Gr4i. Thus {g,} is an increasing sequence, hence convergent in R. Set 
yi(x’) = lim g,(2’), 2! € X’. Then, by 2.5.1, y: € M((X’, A’); (R, Br)). 
Furthermore, v(x) = y1(z) if x € X’ and y(x) = +00 if a ¢ X’. 

Let K be a closed subset of R. Then 


oUK)=pi(K) if +oo¢ K 

gp '(K)=y,'(K)UG if +oeK. 
Since yy '(K) = X’N A with A € A and X’ € A, it follows that yg, (K)e€ 
A. oO 


2.6.2 Corollary. Let f, € M((X,A);(R,Bg)). Then (limsup fr) € 
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PROOF. Let Y, = sup,>,, fp. Then yp is measurable. The sequence {pn (x) } 
is decreasing, hence convergent in R, and 2.5.1 gives the result. 0 


3 Measures and Measure Spaces 


Definition. Let R’ = {¢ € R: ¢ > 0} U {+c0}. Given a measurable 
space (X, A), a measure on (X,A) isa mapping uw: A> R* satisfying the 
following two axioms: 


Countable additivity (c-additivity) axiom 


3.0.1. Let Ay € A, k € I, be a finite or countable family of measurable sets 
that are pairwise disjoint; that is, A, NA; = 0 ifk Al. Then 


(i) ry (U as] = Do u(As)- 


kel kel 


In particular, 

(ii) p(A, U Ag) = pw(A1)+ u(A2) if A1N 42 =0 (finite additivity). 
o-finiteness axiom 

There exist A, € A such that 

3.0.2 X=UnAn, and p(An) < +00 Vn. 


The sequence {A,,} is called an exhaustion sequence for X. If u(X) < 
+00, X is said to have finite measure (or finite total mass) and yp itself is 
called a finite measure. It is possible to develop part of the theory without 
using 3.0.2, the o-finiteness axiom. However, the axiom will always be sat- 
isfied for the applications we have in mind, and we take this point of view 
for ease of exposition. 


Definition. A measurable space (X, A) equipped with a measure pz defined 
on A is called a measure space and is denoted by (X, A, 11). 


EXAMPLE. Let {z;} be a countable sequence of points of X and let {a;} 
be a sequence of positive real numbers. For A = P(X) and A € A, set 


B(A) = > Oj. 


ai,EA 


Then (X,.A, 1) is a measure space. If a; = 1, i € N, this measure pu is 
called the counting measure associated with the sequence {z;}; (A) equals 
the number of points of the sequence {x;} which lie in A. 

This example is trivial and does not reveal the complexity of the theory. 
In fact, we will not obtain nontrivial examples of measure spaces until 
Chapter IT. 
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3.1 Convexity inequality 
Proposition. Let (X, A, 1) be a measure space. Then 


3.1.1 ps is increasing; that is, if Ay and Ap € A and A, C Ag, then p(A1) < 
(Ag). 


3.1.2 ps is convex; that is, if By,...,Bn € A (not necessarily disjoint), then 


fu (U a < M(B). 


PROOF. Let A; C Ag and let B = A{M Ag; then B € A and Ag = A, UB. 
The finite additivity axiom gives 


(Ag) = u(A1) + H(B). 


Since u(B) > 0, we conclude that (Az) > u(A1). 
Similarly, let the sequence B,,...,B,,... be defined recursively: 


B, = B, and By = B, a (Upeg By)" ,qg>i. 


Then B, € A, UL, Bj =U", Bj, and by finite additivity 


B; C B; implies u(B;) < y(B;), and the desired inequality follows. 


3.2 Measure of limits of monotone sequences 


Theorem. Let A,, Ao, ..., An, -.- be an increasing sequence of measurable 
sets. Let 


+00 
lim t A; = LJ Aj. 


i=1 


Then 

3.2.1 p(lim T A;) = lim p(Aj). 

Theorem. Let B,, Bo, ..., Bn, ... be a decreasing sequence of measurable 
sets. Let 


+00 
lim | B; = ( ) Bi. 
i=1 
3.2.2 Suppose that there exists kq such that p(Bg,) < +00. Then 


3.2.3 p(lim | By) = lim p( Bx). 
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REMARK. The properties described by these two theorems are sometimes 
called continuity on increasing sequences and continuity on decreasing se- 
quences. 


PRroor. Consider the measure space (X,.A, 1). For A, € A, set Avs Ay 
and An+1 = AS ia) Ani ifn>1. 
Then A,+41 € A, the A, are disjoint, and A; = Ug<;Aq. Hence, by finite 


additivity, 
= > u(Aq) 
qSj 


Moreover, Ufo, Aj = Wee ay and, by o-additivity, 


+00 a 
(Aj). 
1 


ul 4; ] = 
j=l 
Hence, for increasing sequences, 3.2.1 reduces to the simple observation 
that the sum of a series of nonnegative terms is the limit of its partial 
sums; that is, Bet © u(A;) = lim, ee 4 u(A,). This limit always exists, 
whether it is finite or infinite. 

In order to prove 3.2.3, we set A, = By, 1 BE, k > ko. Then Aj, is 
an increasing sequence. The relation By, = By U A}, By and Aj, disjoint, 
implies (Bry) = (Be) +u(A;,). Hence p(Aj,) < u(Bp,) and pu(lim 7 Aj) = 
lim (AL) = 6 < (By). We have 


(lim | By) U (lim T Ay) = Bro, 


whence 
(lim | Be) + w(lim t Ay) = 4( Beg), 


or finally 


w(lim | By) = 1(Beo) — lim u(Ay) = lim[u( Bro) — #(Ay)] = tim u( Bp). 


3.2.4 Application — Exhaustion principle 


We now roughly sketch a principle that will often be used. Let (P) be a 
property that is true for all finite measures. Let (X,A,y) be a measure 
space with an exhaustion sequence A,. Let X, = An, equipped with the 
trace o-algebra A, of the o-algebra A, and let ju, be the restriction of p 
to A,. Then each jun, is finite and therefore satisfies (P). 

To conclude that yu satisfies (P), it suffices to show that “the limits of 
values of 7, appearing in (P) are finite”. 
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3.2.5 REMARK. Let o be a mapping from A to R* satisfying the finite 
additivity axiom 3.0.1(ii) and property 3.2.1 of continuity with respect to 
increasing sequences. Then o satisfies 3.0.1(i), since 


o(U% Ap) = o(lim(U}A,)) = lim o(U}' Ap) = lim } > o(Ap) = 5) 0p), 


where the third equality follows from finite additivity. 


3.3 Countable convexity inequality 


Proposition. Let {A,,} be a sequence of (not necessarily disjoint) elements 
of A. Then 


+00 +00 
Ll (U 4s] < S> (An). 
n=1 n=1 


Proor. Set By = U%_, An. Then B, is an increasing sequence, and by 
3.2.1 we have 


+00 
i” (U 4s] = lim (By). 


Furthermore, by the finite convexity property 3.1.2, 


+00 
u(By) < S- u(An) < So w(An)-0 


n=1 n=1 


4 Negligible Sets and Classes 
of Measurable Mappings 


The concept of measurable mappings is extremely easy to work with. In 
particular, the theorem that a pointwise limit of measurable mappings is 
measurable makes the operations of analysis very convenient. The drawback 
of this convenience is that the space of measurable functions is enormous, 
and therefore hardly usable. We will work on a quotient space. 


4.1 Negligible sets 


Definition. Let (X,A,,) be a measure space. A subset Z of X is called 
negligible if there exists A € A such that (A) =0 and AD Z. 


4.1.1 Proposition. A countable union of negligible sets is negligible. 


4 Negligible Sets and Classes of Measurable Mappings 17 


PROOF. This follows from countable convexity: 


U (U a) = 2 H(A) 


Since every term on the right-hand side is zero, the sum of the series is 
zero. O 


Definition. A property (P) is said to be true p-almost everywhere (denoted 
pi-a.e.) on the measure space (X, A, 1) if 


{x : (P) does not hold at x} is contained in a negligible set. 


4.1.2 Let (P,) be a proposition implying the proposition (P2). Then (P,) 
true p-a.e. => (P) true p-a.e. 


4.1.3 Theorem. Let (P;),..., (Pn), ... be a sequence of properties defined 
on (X,A,). Suppose that each of the properties (P;) is true p-a.e. Then 
their conjunction is true p1-a.e. 


PROOF. Let A; be a negligible set that contains {zx : (P;) does not hold at x}. 
Then A. = U;A; is negligible. If 2 ¢ A, then all the (P;) hold at 2. 0 


4.2 Complete measure spaces 


4.2.1 Definition. Given the measure space (X,.A, 1), the o-algebra A is 
called y-complete if every subset of a negligible set is measurable. 

The measure space (X, A, jz) is called complete when A is p-complete. 

The space is complete if and only if every subset of a negligible set is 
negligible. 

On a complete measure space, a property P is true p-a.e. if the set 
{x : (P) does not hold at x} is negligible. 


4.2.2 Completion theorem. Let (X,.A,,:) be a measure space. Then 
there exist a o-algebra A’ D A and an extension p' of to A’ such that 
(X,A’,p’) is complete and, for all A' € A’, there evist Ay, Ap € A’ with 
A, C A’ C Ag, (Az — Ai) = 0. This o-algebra A’ is unique and will be 
called the completion of A. 


PROOF. Define 
A’ = {Ze P(X): 5Ai, Ao € A such that Ai C ZC Aa and (Az — A1) = O}. 


Clearly A’ 5 A. We show that A’ is a o-algebra. If Z € A’, then AS C Z° C AS 
and Aj — AS = Az — A1, whence Z° € A’. Hence Axiom 1.0.2 is satisfied. 

Let Z” € A’. Then there exist A? and A} such that A? C Z" C A. Set 
Z* = UZ", AT? = UAT, and AS = UA. Then 


AP CZ ede aid AP sare | (az aay aN 
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The right-hand side, as a countable union of negligible sets, is negligible, 
whence Z € A’ and A’ is a o-algebra. 

To extend p to A’, we first note that u(A2) = u(A1) + w(A2 — Ai) = p(A1). 
For Z € A’, let p’(Z) be defined by p'(Z) = p(A1). 

We now show that this is independent of the choice of A; C Z C Ag. Let 
Alc ZC Ao: At, A. € A, u(Ag _ Ai) = 0. Then Ar >Z >; A, whence 
(Az) > u(A1) = (Az). Similarly u(A2) > p(A2), whence u(A2) = p(A2). 
Moreover, if Z” is a sequence of disjoint sets, then so is AT; hence u(UAT) = 
5 w( At), and we have shown that y’ is countably additive. 

Finally, y’ is complete: letting Z € A’ with p’(Z) = 0, there exists Ag € A 
satisfying Z C Ao and (Az) = 0. Let Z; C Z. Then @ C Z C Az, where 0, Ag 
€ Aand (Az — 0) = 0. Therefore Z, € A’. 


4.3 The space M,,((X,A);(X', A’)) 
(i) On M((X, A); (X', A’)), let the equivalence relation be defined by 


fof if fl@)=f(e) pre. 
The equivalence class of f is denoted by f. 


(ii) The transitivity of this relation follows from 4.1.3. 


4.3.1 Definition. The quotient of M by this equivalence relation is denoted 


An element f € M, is a mapping f : X — X’, defined “up to a set of 
jl-measure zero”. 


4.3.2 Let E be a negligible set and let py: X —-E— X’. 
Suppose that y is a measurable mapping when X — € is given the trace 
a-algebra induced by A. Define f : X — X’ by setting 


f(z) = v(x) if rEXx-E 
f{(2) = 26 if weé, 


where x is an arbitrarily chosen element in X’. 
Then f € M((X,A);(X',A’)), and y determines the equivalence class 
of f in M,,((X, A); (X", A’). 


4.3.3 REMARK. When X’ = R and A = Br, the operations defined on 
measurable functions (sum, product, sup) are compatible with the equiva- 
lence relation. The quotient of £°(X,A) = M((X,.A);(R, Br)) is denoted 
by LO (X A). 

Thus the operations sum, product, and sup are defined on LAX A), 
Moreover, any element of oe. , A) with a representative that is nonzero 
almost everywhere has a well-defined inverse. LO (X , A) is called the space 
of equivalence classes of measurable functions. 
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5 Convergence in M,,((X, A); (Y, By)) 


Throughout this section (X,A,) denotes a measure space, Y a 
separable metric space, and By the Borel algebra of Y. 


5.1 Convergence almost everywhere 


5.1.1 Definition. Let f,, € M,,((X, A); (Y, By)). {f,,} is said to converge 
almost everywhere if, when representatives f,, of f,, are chosen, {f,,(z)} is 
convergent j-a.e. 

We first show that this definition is independent of the choice of rep- 
resentatives. Let g, = fn ji-a.e. Denote by (P,,) and (Ff) the following 
propositions: 


(F) lim fn (x) exists. 


Let (G) be the conjunction of (F) and the (P,). Then, by 4.1.3, (G) 
is true -a.e. Since (G) implies the convergence of the gn, 4.1.2 gives the 
result. 


5.1.2 Proposition. Let f,, € M,,((X, A); (Y, By)). Suppose that {f,,} con- 
verges almost everywhere. Then 


lim fn(z) 


defines an element Gy € M,,((X, A); (Y, By)). 


PRooF. Choose an arbitrary yo € Y, let (F) be defined as in 5.1.1, and let 
K be a negligible set such that K > {x : (F) is not satisfied at x}. Let 


Qn(t) = fn(x) rek 
Gn(2) = Yo rek’. 


Then, by 4.3.2, gn € M((X, A), (Y, By)) and Gj, = fn- 

Moreover, if c € K then {g,(x)} converges by 5.1.2; if « ¢ K, then 
9n(x) = yo and hence the sequence converges. 

Thus {g,(x)} converges for all x € X, and Theorem 2.5.1 shows that 
go = lim gy satisfies 

go © M((X, A); (Y, By)). 
Hence 
lim fy, = go € M,((X, A); (Y, By)).0 


5.1.3 Lemma. Given f,g € M((X,A);(Y,By)), let apg be defined by 
ap.g(@) = A( f(x), 9(x)). Then qy,g is a measurable function and Vn € Rt 
{X: Gf.g(x) > n} is measurable. 
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Proor. Let Y? = Y x Y and let w be the mapping from Y? into Rt 
defined by the distance: (yi, y2) = d(yi, y2). 

Let H : X — Y? be defined by x + (f(a), g(a)); then, by 2.3.5, H € 
M((X,.A); (¥?, By2)). 

Since w is continuous, yo H € M((X,.A);(R, Br)). 

Moreover, since (7, +00) is an open set in R, qj g((, +o0))E€ AO 


5.1.4 Egoroff’s theorem. Let (X,A,) be a measure space and suppose 
in addition that u(X) < +00. 

Then fn € My((X,A),(Y,By)) converges p-a.e. to fy if and only if, 
choosing representatives f,, fo of the classes fie; Fag 


Ve>05K.€A_ such that p(Kf) <e 


and f,(x) converges uniformly on K,. to fo. 


PRoor. Necessity is clear. Set ¢€ = m~!, m a positive integer. Then fy, 
y g' 


converges to fo on UK,,-1 = G. Since u(G°) < p(KS_,) for every m, 
u(G°) = 0. 
We now prove sufficiency. Set 


Ang = {2+ d(fute) fola)) > =}. 


Then A,,_ € A by 5.1.3. 

Let Bmjq = Un>mAn,q. Since Bmq is a decreasing sequence for fixed q, 
the hypothesis of convergence p-a.e. together with the limit theorem 3.2 
imply that u(B,,,q) — 0 for every fixed g as m — +00. 

Fix an increasing sequence m, such that (Bm, 4-1) < € 27". Set K. = 
Ure, Biny.k-1- Then 


1 
w(Ke)<e and d(fm,(z), fo(x)) < Z if 7 >m,z, ce K,.0 


5.2 Convergence in measure 


Convergence almost everywhere allowed us to introduce a notion of conver- 
gence of sequences in M,,. We now define a metric on the space M,,, and 
thus a new notion of convergence. 

Let (X,A,,1) be a measure space and let (Y,By) be a metric space 
equipped with its Borel measure. We denote by d the distance on Y. 


5.2.1 Construction of an extended distance on M,,((X, A), (Y, By)) 


Let f,g € M and let qs, be as defined in 5.1.3. With the pair of functions 
(f,g) we associate the subset of (R*)? defined by 


K(f,9) = {(e.n) € (RT)? : wap g(n, +00) < €}. 
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Set 
e(f,g) =inf(e+7) where (e,n) € K(f,g). 


If K(f,g) is empty, we set e(f,g) = +00. 


5.2.2 An equivalent extended distance 


Set 
e(f,g) = 2inf(A) where (A,A) € K(f,9). 
Then we have 


e(f,g) S e(f,g) < 2e(f,9). 


The first inequality is proved by writing é(f,g) = inf(A+ ), with (A, A) € 
K(f,g). Note, moreover, that if (€,7) € K(f,g), then (e+a,7+) € K(f,g) 
for any a,3 > 0. Ife > 7, we take a = 0 and @ = € — 7 to obtain 
e(f,g) < 2 < 2e(f,g). The case « < 7 is treated in the same way. 


5.2.3 Lemma. Let f,g,h € M((X, A); (Y, By)). Then 


e(f.g) = e(g.f), 
e(f,g9) 0 is equivalent to f(x) = g(x) p-a.e., and 


elf,h) < e(f,g)+e(g,h). 


PROOF. The first statement is clear, and we prove the second. If e(f,g) = 0, 
then there exists a pair 


(En; 7) € K(f,9), En 0, ™In > 0. 


We may assume that , is a decreasing sequence. Then q~'((7n, +00)) is an 
increasing sequence and, by the limit theorem 3.2.1, 


u(lim T 95.4 ((1m, +00))) = lim p(q7,§ (mm, +00))) < limen = 0, 
whence 
w({x: d( f(z), 9(x)) > 0})=0, ie. f(x) = g(x) pae. 
Conversely, if f(x) = g(x) pr-a.e., then 
u(q7g((7,+00))) =0 Vn > 0. 


It remains to show that the triangle inequality holds. By the triangle inequality 
on Y, 


ap.n(@) S a4.g(%) + Qo,n(x)- 
Let (€1,m) € K(f,g) and (€2,72) € K(g,h). Then qy,n(x) > m1 +72 implies that 
df.g(Z) > m OF Qg,n(x) > N2. Hence 


gj..((m +2, +00)) € a74((m, +00)) U gy (m2, +00) 


and, by the convexity inequality, 


u(q5.n((m +12, +00))) < er + ee. 
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We have thus shown that (e1,7) € K(f,g) and (€2,72) € K(g,h) imply that 
(€1 + €2, +2) € K(f, h). 
Set 
G = K(f,g)+K(g,h) 
{(e.n) €(R*)?: e=a te, n=m +m, 
with (e1,m) € K(f,g), (€2,m2) € K(g,h)}. 
Then Gc K(f,h), and we obtain 


e(f,h) = inf (e+) < inf (€+ 7) = inf(er +e2 +m +72) 
(enek (e.neEG 


with 
(e1,m) € K(f,g) and (€2,m2) € K(g,h). 
Thus 


e(f,h) < inf(er + m) + inf(e2 + m2) = e(f,g) + e(g,h). 
5.2.4 Corollary. If f = f' andg =! pu-ae., then e(f,g) = e(f', 9’). 
ProoF. Since e(f,g) < e(f, f’) + e(f',9’) + e(g’,g) and the hypotheses 
imply that the first and third terms on the right-hand side are zero, it 


follows that e(f,g) < e(f’,g’). 
The opposite inequality is proved in the same way. O 


REMARK. e(f,g) depends only on the equivalence classes f and @. 
Abusing notation, we set e(f,9) = e(f,g), where f and g are chosen in 
the classes of f and g. 


5.2.5 Proposition. Suppose that (X,A,j) is a measure space and Y is 
a metric space. Let M,,((X, A); (Y,By)) be the space of equivalence classes 
of measurable mappings from X to Y and let e be as defined in 5.2.2. Set 


Then d,, is a distance on M,,. 


PROOF. Lemma 5.2.3 shows that e satisfies the axioms for a distance, except 
that e may assume the value +oo. We use a construction common in topology; 
let ; 
k(t)=——, teR*, k(+o0)=1. 
=. (+90) 


It is elementary to verify that the function t +> k(t) satisfies 
k(ti + tz) < k(t) + k(t2), ti, tg > 0. 


It follows that d, satisfies the triangle inequality and thus defines a distance 
on M,. 


REMARK (i). If w(X) < C, then it is always true that (C,0) € K(f,g) and 
hence that e(f,g) < C. In this case it is unnecessary to use d,,; e may be 
taken as a distance. 
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REMARK (ii). A sequence f, € M,, is a Cauchy sequence with respect to 
the distance d,, if and only if e(fm, fn) —- 0 whenever m and n — +00. 


5.2.6 Definition. A sequence f, is said to converge to fo in measure if 


e(fns fo) =? 0. 


Proposition. The sequence f, converges to fo in measure if and only if, 
for every fixed n > 0, 


u({z: d(fn(x), fo(x)) >t) +90 as n— too. 


PRoor. (<) Let no be such that 
u({x: d(fr(x), fo(x)) > n}) <n if n&no. 
Then (7,7) C K(fns fo), whence 
e(fn, fo) <2n if n2no. 


(=) Let 7 < 7 be given. Using 5.2.2, we can find n; such that €(fn, fo) < 


2m, ifn > ny; 1.e., (m,m) € K(fn, fo). Hence 


w({x: dfn, fo) > n}) <m. 


Since {x : d(fn, fo) > m} Cc {x : d(fn, fo) > n}, it follows a fortiori that 


w{e:d(fr, fo) > n})<m ifn > n,.0 


5.2.7 Theorem (Comparison of convergence in measure and con- 
vergence almost everywhere). Suppose that (X,A,) is a complete 
measure space, Y is a metric space, fo € M,,((X, A), (Y, By)), and {fr} is 
a sequence in M,,((X, A), (Y, By)). 


(i) If du(fn, fo) > 0, then there exists a subsequence {fn} of {fn} 
such that fn, — fo p-a.e. 

(it) Suppose in addition that us is a finite measure. Then the p-a.e. 
convergence of fn to fo implies that du(fn, fo) — 0. 


The proof depends on the following important lemma: 


5.2.8 Lemma (Borel-Cantelli). Let {A,} be a collection of elements of 
A such that 


S > (An) < +00. 


Then p-almost every x lies in at most a finite number of An. 
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vexity inequality; hence limm_—.o “(Bm) = 0. But since B, is a decreasing 
sequence, it follows from the continuity theorem (3.2.3) that u(O,Bn) = 
lim u(B,) = 0. Note finally that rx ¢ 9,B, = x is in only finitely many 
A,.O 


PROOF. Set Bm = Un>mAn. Then (Bm) < ak u(A,) by the con- 


PROOF OF THE THEOREM (PART (i)). Let ax, be the general term of a 
convergent series (for example, a; = 2~*). Fix an increasing sequence {nx} 
such that €(fm, fo) < 2ax if m > nz. Set 


Ay ={x: d(fn,(2), fo(v)) > ax}; then p(Ap) < ag. 


The Borel-Cantelli lemma implies that, u-almost everywhere, x belongs 
to only finitely many A;. Thus 


for p-almost every x, there exists an integer s(x) such that d( fn, (x), fo(x)) 
<a, ifk > s(x). 


Hence f,, converges ji-a.e. to fo. O 


PaRT (ii). Fix € > 0. Set 
G, = {2 : sup d(fq(x), fo(z)) > cb. 
q2n 


Then {G,,} is a decreasing sequence and, by 5.1.3 and 2.6.1, G, € A. Since 
G, C X implies that u(G,,) < +00, we can use the limit theorem 3.2.3 to 
conclude that 1((),, Gn) = lim u(G). 

The hypothesis of convergence p-a.e. implies that the left-hand side is 
zero. Let no be such that u(G,,) < €; then we have €(fn, fo) < 2€ ifn > no. 
Oo 


5.2.9 Theorem. Suppose that (X,A,) is a measure space and Y is a 
complete metric space. Then M,,((X, A); (Y, By )), equipped with the metric 
dy, 1s a complete metric space. 


PROOF. Our approach parallels that of the proof of 5.2.7(ii). Let {f,} bea 
Cauchy sequence in M,,; using a result from topology, we need only show 
that the sequence {f,,} has a subsequence that is convergent with respect 
to the distance d,,. Let a; be the general term of a convergent numerical 
sequence. Fix an increasing subsequence {n,} such that €(fn,, fm) < 2an 
for all m > ng. Set 


Ay = {x : A fre (2), Frys (€)) > ax}; 
then p(Ax) < ag, or 


A fri (®)s fry. (@)) <a, if x ¢ Ag. 
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Let € > 0 be given, fix ko such that }°,5,,@% < €, and set Dg. = 
Uk>ko Ak- Then 


() — w(Dro) <€ and YP dfn. (2): frer(w)) <€ if @ ¢ Dro. 
k>ko 
Hence {yz} = {fn,(z)} is a Cauchy sequence if x ¢ Dy,. 
As this is true for every ko, it follows that {f,,(z)} converges if x ¢ 
AbD; but u(N~De) = 0, ie. { fn, } converges pra.e. to fo € My. 
By inequality (i) and the triangle inequality, 


(it) dfn, (2), fry (£))<e€ if kk’ >ko andif xz ¢€ Dry. 
Fixing nz and letting k’ go to infinity, we obtain 
(fn, (2), fo(z)) <e if k>ko anda ¢ D,,, 


whence 
€(fna» fo) < 2€ if k>ko, 


or 
al Fra FO) < 2e if k> kyp.O 
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In this section, we exhibit a vector subspace of M,,((X,A);(R,Br)) = 
L°(X,A) which will be provided with a Banach space structure. The dis- 
tance defined by this norm will be an upper bound for the distance d,,, and 
will thus define a finer topology than that associated with d,,. 


6.1 Simple measurable functions 


Let (X,A) be a measurable space. A simple function is a measurable map- 
ping from X to R such that cardinal (f(X)) < -+-oo. We denote by €°(X, A) 
the set of simple functions. 

Let (X,A,) be a measure space. We denote by EQ(X , A) the subset 
of L(x , A) consisting of those equivalence classes of measurable functions 
which contain a simple function. 

If f,g € €°(X, A), then 


card((f + g)(X)) < card(f(X))card((9(X)) 


and 

card((fg)(X)) < card(f(X))card(g(X)), 
so that €°(X, A) is a vector space equipped with a product. The same holds 
for E)(X, A). Moreover, if f € £°(X, A) so is |f|; hence the operation sup 
is defined on €° and E?. 
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6.2 Finite o-algebras 


Let Y be an abstract set and let B be a o-algebra on Y. B is called finite 
if it has only a finite number of elements. Note that the finite Boolean 
algebras coincide with the finite o-algebras: the countable union property 
reduces to the finite union property in this case. Sets B € B such that 


B' cB, B'€B implies either B’=B or B’=90 


are called atoms. 
Atoms are the minimal elements with respect to the inclusion relation in 
a g-algebra. If B and B are distinct atoms, then BN B = 0. 


6.2.1 Proposition. Let B be a finite o-algebra. Then every nonempty set 
in B is the union of the atoms it contains. 


Proor. Let A € B. Either A is an atom or JA; C A, Ai # A, Ai € B. 
Repeat the argument, starting from Aj: either A; is an atom, or JA C A}, 
Az # A,, Az € B. This produces a sequence of subsets of Y, each strictly 
contained in the preceding one. Since B is finite, the process must terminate 
after finitely many steps, yielding an atom. We have thus shown that 


every nonempty set A € B contains at least one atom of B. 


Let H;,..., Hq be the atoms of B contained in A and let A = U,;H,;. Then 


ACA. ional: ACOA € B. If ACN A were nonempty, ACN A would 
contain an atom; but all the atoms contained in A are contained in A, 
whence A = A. 


6.2.2 Corollary. Let B be a finite a-algebra of subsets of Y. Then there 
exist a finite set Sp and a bijection between B and P(Sp), the set of all 
subsets of Sp, such that the bijection respects the Boolean algebra structure 
(the operations of union and intersection). 


PROOF. We take for Sp the set of atoms of B. The bijection between B and 
Sg is obtained by associating with each set B € B the atoms it contains. 
O 


6.2.3 Partitions 


Definition. A partition of X is a finite family of pairwise disjoint subsets 
of X, say Ky, ..., Ky, whose union is X. The o-algebra B generated by 
the K;, 1 < i <n, consists of sets B of the form B = Us Ky,. 


The atoms of B are precisely the K,. Conversely, given a finite o-algebra B 
on X, its atoms form a partition of X. 
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6.2.4 Finite o-algebras and simple mappings 


Let (X,.A) be a measurable space and let Y be a metric space. A function 
fe M((X, A); (Y, By)) is called simple if card(f(X)) is finite. 


Proposition. A mapping f is simple if and only if f~'(By) is a finite 
o-algebra . 


PROOF. (=>) Let x}, ... x be an enumeration of the image f(X). Then 
f-*({zi,}) are the atoms of f~'(By). 

(<=) Let U be an atom of f~!(By). Suppose that f assumes two distinct 
values on U, say y; and yg. Let O, and Oz be disjoint open sets in Y, 
yi € O; (4 = 1,2). 

Set UN f-1(O;) = U; (i = 1,2). Then U; € f-'(By), U, 40, U, CU, 
and U, # U, contradicting the hypothesis that U is an atom. 0 


6.3 Simple functions and indicator functions 


Given a subset A of X, the indicator function of A, written 1,4, is the 
function equal to 1 on A and zero on A®: 1a(x) = 1lifx € Aand 1y(zr) =0 
otherwise. 

The next proposition is easily verified. 


6.3.1 Proposition. 14 1g = laqg andl, + 1¢ = 1,4uc + Lanc. More- 
over, A is measurable if and only if 14 € E°(X, A). 


6.3.2 Proposition. Suppose that f assumes only finitely many values. Let 
B be a finite o-algebra such that B > f~!(Br). Then f can be written 
uniquely in the form 


i So alu, with a;€R, where the H; range over the atoms of B. 


PRooF. Let Ho,..., Hy be the atoms of B. Let € € f(X); then the hypoth- 
esis f~'(€) € B implies that f~1(€) can be written as a union of atoms. 
Hence f has constant value, say a;, on H;. The two sides of the identity co- 
incide on H; for every i, and since U;H; = X the identity holds everywhere. 
OD 


6.3.3 Corollary. The measurable indicator functions generate the vector 
space of simple functions. 


Proor. Let f be a simple function and let B = f—!(BR) C A. Then B is 
a finite o-algebra by 6.2.4. 0 


6.4 Approximation by simple functions 


6.4.1 Proposition. Let f € £°(X,A) be bounded. Then there exists a 
sequence of simple functions g, converging uniformly to f. 
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PROOF. Consider the half-open interval 
Jy = [kn7!, (k + 1)n73). 
We may write it as a countable union of closed sets in the following way: 
Jp = U fin (i+ (1 _ a) no] : 
q 


Hence J; is a Borel subset of R, and 1), € Lo CX, Br). 
Let C and ko be such that |f(a)| < C and 


+ko 
LJ a > [-C,+¢]. 
k=—ko 
Set 
Gn = S- ee Ge 
—ko<k<+ko 


Since the J, [-C,+C] form a partition of [-C,+C], we have t —n7! < 
Gn(t) < t if |e] < C. 
Moreover, G,, takes only finitely many values and 


Gn € M((R, Br), (R, Br)). 
Set gn = Gyo f; then g, € E°(X, A) and 


|Gn(z) =, f(z) < nto 


6.4.2 Corollary. Let f € £L°(X,A). Then there exists a sequence {pp} of 
simple functions converging pointwise to f. 


Proor. Let A, = {x : |f(x)| < n}. Then f, = 14,f is a bounded 
measurable function. Let y, be a simple function, constructed (as in 6.4.1) 
so that 

lfn(@) — Yn(x)| <n! for all z. 


Then 
lim y,(z) = f(z) Vee X.0 


6.4.3 Corollary. Let (X,A,) be a measure space and let (X, A’, y’) be 
its completion (in the sense of 4.2.3). Let f' € £L°(X, A’) be given. Then 
there exists f € L°(X, A) such that f(z) = f' (x) p-a.e. 

In particular, L}(X,A) can be identified with Lo, (X,A’). 


PROOF. Consider first the indicator function of a set A’ € A’. There exist B,Ce 
A such that B C A’ C C and u(C — B) = 0. In particular, lg = 14 p’-ae. 
Hence the corollary is true for A’-measurable simple functions. 
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Now let f’ € £°(X, A’) be given. By 6.4.2 there exist yi, € E°(X,A’) such 
that lim y{,(x) = f(x) Veex. By the argument above, there exist 6, € €°(X, A) 
such that A’, = {x : n(x) 4 yn (zx)} satisfies y’(Aj,) = 0. 

Let Aw = UnA},; note that A € A and p(Aw) = 0. Define yn(x) = Gn(z) if 
z ¢ Ax and yn(x) = 0 otherwise. Then yn € £°(X,A) and {yn(x)} converges 
for every x. Moreover, setting lim y, (x) = f(x), we see by 2.5.1 that f € £L°(X, A) 
and f'(x) = f(x) pae. O 


6.5 Integrable simple functions 


6.5.1 Definition. Simple functions f such that u({x : f(x) 4 0}) < +00 
are called integrable simple functions. We denote by € X , A) the integrable 
simple functions and by E/|(X, A) the equivalence classes in €°(X, A) gen- 
erated by the integrable simple functions. 

EN(X _ A) is a vector subspace of €°(X,.A) which is closed under multi- 
plication and absolute value. 


6.5.2 Definition of the integral on Gilx ,A) 


Let f € EL(X, A) be written in the unique form associated with the 
o-algebra f—!(Br), as in 6.3.2: 


f= So alu, (where a; # 0 Vi). 
A 


The integral of f is defined by the formula 


I(f)= dL oin(Hi). 


If fi € 0.6 A), fi = f ae., then it is easily verified that I(f) = I(f1). It 
follows that the function I(-) is defined on E/)(X, A). 


6.5.3 Lemma (Evaluating the integral on certain finite c-algebras). 
Let f be an integrable simple function and let B be a finite o-algebra such 
that B > f~!(Br). Denoting by K,, ..., K, the atoms of B, let 


f= > Balk, (where By #0 Yq) 
q 


be the decomposition of f given by 6.3.2. Then 


I(f) = So Baul Ka). 


PRooF. Let {H,} be the set of all atoms of f~'(Br). Since each H, is in 
B, H, can be written as a union of atoms of B: H, = U Ky, where I; 


‘ 4 qéls 
is a finite set. 
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On each K, (q¢ € Is), f = as; thus a, = 3, if qe I, and 


Y Aan(Ky) = OD oan) = Foo. Ta) 


q s qé€lI, s qels 


But (Hs) = Viger, H(A). O 


6.5.4 Theorem (Properties of the integral on simple functions). 


(i) The integral defines a positive linear functional on Ei (X, A). 

(ii) Setting q(f) = I(\f|) defines a norm on Ej(X, A). Moreover 
WAI < a0) 

(iit) wa : ie — falx)| > nh) < Gah — fe) (Chebyshev’s 
inequality). 

(iu) e(fi. fa) < 2q(fi - fe)? 

(v) Every Cauchy sequence in the normed space E} ulX,A) is a 
Cauchy sequence with respect to the distance of convergence in 
measure. Convergence in norm implies convergence in measure. 


PROOF OF (i). Let 7 be a constant. Then I(yf) = yI(f) for every f € 
Ej,(X, A). 

Now let fi, fo € EX, A). Let F be the mapping from X to R? defined 
by setting F(x) = (fi(2), fo(2)). 

Then Fisa Sipe mapping and F~!(Br2) = Bis a finite sub-o-algebra 
of A containing f; '(Br) (i = 1.2). The decomposition of f; on the o-alge- 
B gives 

fi=S081K, and fo= S$ 61K,, 


where the A’, range over the atoms of B. Then f, + f2 can be decomposed 
in the o-algebra B as fi + fo = S7(8s + 65)1K,, whence 


I(fitfa) = S5(8.+8s)e( Ks) = D> Bet Ka)+>— d5u(Ks) = LC fr) +1 (f2)- 


If f(x) > 0 pra.e., the only coefficients appearing in the sum are the non- 
negative G,. Thus 


(vi) f(x) = 0 pra.e. implies I(f) > 0 (positivity of the integral). 


PROOF OF (ii). By the positivity of the integral, the inequality |f + h| < 
|f| + |R| implies that ¢(f +h) < g(f) + 4(A). 

That q(af) = |alg(f) is trivial. It remains to show that q(f) > 0 and 
that ¢(f) =0 implies f = 0 p-a.e. 

The first inequality follows from the positivity of the integral. Moreover, 
in a o-algebra adapted to f, 


Tf) = So lada(e 


a; XO 
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and this sum of nonnegative terms can be zero only if all the terms are 
zero. 

Finally, —|f| < f < |f| implies the same inequality for the integrals: 
—I(\fl) < Tf) < TfI). 


PROOF OF (iii). We use the same finite o-algebra B as in the proof of 
(i) and the same decompositions of f; and fz on the atoms of B. Then 


fi-fo= (8s — 6s) 1K, and q(fi— fz) = ae ysu( Ks), where ys = |G5—6s|. 
u({x : |fi(x) — fo(x)| >n}) = So u(Ks), where J ={s: 7s >n}, 


sed 


afi — fo) > So ysu(Ks) > 1 D> (Ks) 


sed sed 


and 


It follows that 


ahi — fe) > me({@ = |fil@) — fe(@)| > n})- 


PROOF OF (iv). Consider the subset K(f1, fo) of (R*)?, which was used 
to define e( fi, f2): 


K (fi, fo) = {(e.m) : mM fi — fel > 7) < ef. 
Then, by (iii), 


(n~'a(fi — fo).n) € K(f, fe) for all 1 >0. 


Hence 
e(fi, fa) = inf(e +n) < inf(n +9 'a(fi — fa). 


Taking 7 = [q(fi — fo)|? shows that e(f1, f2) < 2[q(fi — f2)|?. 0 


PROOF OF (v). It follows immediately from (iv) that a Cauchy sequence in 
the normed space Ey, is a Cauchy sequence with respect to the distance of 
convergence in measure. Similarly, a sequence that converges to fo in norm 
also converges in measure. 0 


6.6 Some spaces of bounded measurable functions 


6.6.0 Definitions 


Le(X,A) = {f € £L°(X,A)): 43M < co such that |f(a)| <M}. 
Lye (X, A) {fe £°(X, A): w({e: f(x) #0)}) < co}. 


I 
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6.6.1 Proposition. For every f € £7 '(X,A), there exist pn € ABS A) 
such that 


(i) {yn} 4, uniformly to f, and 
(vt) {v: pn(a) #0} = {x: f(x) 4 0)}. 


PrRoor. Cf. Proposition 6.4.1. 0 


6.6.2 Proposition. If {y,} satisfies 6.6.1, then {I(p,)} is a Cauchy 


sequence. 


Proor. Let K = {x: f(x) #0)}. Then y,, = y,lx and 


(en - Ym) = I((en - ~Pm)1lK) < p(k) sup lyn — Pm — 0 


by the uniform convergence of {y,}. 


6.6.3 Definition. I(f) = lim I(yn) Vf € ae where {y,,} is the sequence 
of Proposition 6.6.1. 


This is independent of the choice of sequence. Let {y/, } be another sequence 
satisfying 6.6.1(i). Set 


Pin = Pm/2 if m is even, and 
gy, = Pim—1)/2 if m is odd. 
Then y/), satisfies 6.6.1(i) and hence lim /(y7,) exists. But this implies that 


m 


lim J(y,) = lim I(¢y,). 


6.6.4 Proposition. Let f € ve lak Then the following statements are true: 


(i) Whi + fe) = 1h) + 1p). 
(i) fizth= Thi). > T( fa). 
(ii) fy = fo ae. = I(f,;) =I (fo). 


6.7 The truncation operator 


For a fixed positive integer n, let y, be the continuous function defined on 
R by 

Pn(t) = if -n<t<4+y%7 
Pal = if t>n 

Pr(t) = if t<—n. 


Let A; C Ag... C Ay... be an exhaustion of X, i.e. (Ap) < +oo Vk 
and X = U; Ap. 
We define T,,, the truncation operator of order n on LUX , A), as follows: 


6.7.1 Tr(f) = fnla,, where fn =%nof. 
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fn is bounded and (since y, is continuous) measurable. Furthermore, 
since the set {x : (Tn f)(x) #0} C An, it has finite measure. Hence, by the 
definition of L7°", 


6.7.2 Tn(f) € £2°"(X,A) for any f € £L°(X, A). 


6.8 Construction of L! 

6.8.1 Definition of L/,(X, A) 

(i) Definition. £1,(X, A) = {f € £°(X, A) : limn oo I(|Tn(f)|) < +00}. 
Proposition. If f; € gi, and fz = fi a.e., then fo € LX, A). This 
justifies the notation 


Dh X, A) = {equivalence classes of LAX A)}. 


(it) filo. = lim I(\Tp(f)I)- 
(iii) If f € L° and |f| < |h|, where h € Li, then f € Lj. 


(iv) If f € L(X, A), then f € L)(X, A). 


6.8.2 Proposition. If f € L/,, then limn+.I(TIn(f)) exists. 
ProoF. Let f* =sup(f,0) and let f~ = sup(—f,0). Although T;, is not a 
linear operator, it is elementary to verify that, for all x € X, 
Tn(f)() = Tr(f*)(@) — Tr(f7) (2) 
and 
ITn(f)| = Tn(f*) + Tr(f7), 
whence ee a: 
U(Ti(F*)) < LTaCA))) < Wf llee- 


{I(Tn(f*))} is thus an increasing sequence which is bounded above, and 
therefore converges. 0 


Definition. For f € L/,, the integral of f is defined by / f =limI(T,(f)). 


6.8.3 Proposition. Li is a vector space with the following properties: 


i) ffith)y=SfAt+f fe 
(ii) If f >0, then f f > 0. 


Set ||flln = ff]. Then 


(wi) | f fl <\lflln- 
(iv) w({x : f(x) > eh) < cllflln- 


(v) \|fllz1 is @ norm. 


PROOF. The statements clearly hold for Le and pass to Die Oo 
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7 Theorems on Passage to the Limit 
under the Integral Sign 


7.1 Fatou-Beppo Levi theorem. Let {f;,} be an increasing sequence of 
integrable functions such that [ fy, <C, where C is a constant independent 
of n. Then 


(i) lim fr = fx exists and is finite p-a.e., 
(it) Foo € Li, and 
(tit) \|fn — foolln: — 0. 


PROOF. By setting i= fn — fi, we may assume that f, > 0. Then a= = 
follta, Ty(foo) = limn Ty(fn), and fT)(fn) < f fa < C. It follows that 
|Ta( foot: < C, whence veins (Ty(foo))(x) > n}) < Cn7!. Furthermore, 


{x : foo() > n} = lim T {n: (Ty(foo)) (a) > n}. 
Thus 


= 


H({@: foo(x) > n}) < 


Hence fx. € L'. 
We now show that || fs. — fnilp: — 0. Let un = foo — fn. Then 


and ||Tq(foo) lz SC. 


T(t) F Tyo (1) 2 Ty (tn) ~~ Tao (Un), where go <4. 


Let go be chosen so that f T)(ui) — {Ty,(ui) < §. Then 
€ 
Ilun || < 2 TP Zo (un) Ilo- 


Let vn = Ty, (un). Then 0 < un < qo, Un(x) =O if x € Ajo: and Un — 0 
a.e. Recall, from 6.7, that ju(Aq,) < +00. 


By Egoroff’s theorem, there exists K such that u(K‘°) < 7 and Un 
qo 


converges uniformly to zero on K. Hence 


€  € 

lJun|| <= +--+ u(Ag,) sup(vn(z)) —>0 as n> 00.0 
2 4 ce k 

7.2 Lebesgue’s theorem on series. Let {u,}°, be a sequence of el- 

ements of L' such that > |\unllz1 < co. Then 37° un converges abso- 

lutely a.e. Let 8) = uy +...+ Uy, and let s,, = lim, s,. Then s, € L, 

J 800 =lim f sn, and ||850 — 8n||z1 — 0. 


PRooF. Set f,(z) = S7p_, |ux(x)|. Then {f,} is an increasing sequence 
and f fn < 2725 |luslln: < +00. By the theorem of Fatou-Beppo Levi, this 
implies that lim f, = foo exists, foo € Lt, and fx, < +00 a.e. Thus 54, € L! 
since |s..| < foc, and ||Sa5 — Snllz. < || foc — fallz:, which approaches zero 
by Fatou-Beppo Levi. 
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7.3 Proposition. The truncation operator is a contraction on Ls, A); 
that is, 


Inf) —Ta(Alnr SUF - fll, WF © LAX, A). 


Proor. Assume first that f and f are simple functions. Let B be the 
g-algebra generated by f~'(Br), f~'(Bp), and {An}, and let S denote 
the atoms of B. Then 


f= )lonly and f= > ofa where Hz ES. 


Let 1 = {H €S:HNA, #0}. Then 


Tn(f) = Soen(an)Lan: 


kel 


TF) = Soen(Gn)den, 


kel 
ITn(f) — Ta(f)| So len(ar) — en (Ge) lL. 


kel 


I 


Using the elementary inequality |yn(t) — yr(t’)| < jé-t’|, Vt, eR, 


ITn(f)—Tn (Poller < 3° eA )len— Gel < SO w(Ax)lox—&x| = [Lf — files 


kel keS 


Now let f and f € L'. We can find two sequences hg, hq of simple 
functions converging in the L! norm to f and 3 Passing if necessary to a 
subsequence, we may suppose in addition that hg and hg converge a.e. Then 
Tn (hg) — TIn(Rg)Ilz1 < [|g — Aalle; hence T;,(h,) is a Cauchy sequence in 
the L! norm. Let k be its limit. Then k = T,,(f) since h, converges a.e. to 
f, and hence 


|Tn(Rg) — Tn(f)llz1 — 0. 
It follows that 
Tn(F)—Tn (Alles = mm Ty (%4q)—Tr (He) ler $ Yim [bq —Fegllas = Fle 


a) 


7.4  Integrability criteria 


7.4.1 Theorem. Let f € L)(X,A). Then f € Li(X,A) if and only if 
there exists a constant C such that, for all n, ||T,(f) lb. < C. 


PROOF. (<) Applying 7.3 with f =0 yields ||Tn(f) lz. < ||fllz- 
(=) We prove this first in the special case that f > 0, where Tn(f) < Tn4i(f). 
By the Fatou-Beppo Levi theorem, there exists g € L’ such that lim Tn(f) = g 
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a.e. Moreover, a direct calculation shows that lim T,,(f)(x) = f(z) for allz € X. 
Hence f = g, and therefore f € L’. 

For the general case, set ft = sup(f, 0) and f~ =sup(—f,0). Then ft, f~ € 
L°, f+, f~ are positive, and f = f* 

Since |Tn(f)| = Tn(f*) + Trl f7) (cf the proof of Proposition 6.8.2), 


(Tell = fron = frcry+ fr 


It follows that 

ITn(f* Ins SC and ||Tn(f~ Ina < C. 
Since f* and f~ are nonnegative, this implies that f* and f~ © L! and hence 
that f € L'. 


7.4.2 Corollary. Let f € L)}(X,A). Then f € LIX, A) if and only if 

[fl € L'(X, A). 

PRooF. The direct implication follows from 2.4.4 and 6.8.1(iii). 
Conversely, assume that |f| € LIX, A). It is easy to see that |T;,(f)| = 


Tr(l fl), whence |/7,(f)|lz. = (TnCFf])llz:- The conclusion follows by ap- 
plying Theorem 7.4.1. 0 


7.4.3 Corollary. Let f € L°(X,A) and suppose that there exists u € 
L},(X, A) such that |f| <u. Then f EL) (X, A). 


ProoF. |[Tr(A)llz < Tn(lla < [ules 9 


7.5 Definition of the integral on a measurable set 


Let (X,.A,,.) be a measure space and let Y be a fixed element of A. We 
denote by A’ the trace on Y of the o-algebra A and by p’ the restriction 
of 4 to the elements of A’, thus obtaining a measure space (Y, A’, ’). Let 
j be the canonical injection of Y into X. The restriction operator defines 
a mapping L)(X,A) > L),(Y,A’) by f > foj. 

Let f € Li, MX, A, 1). We denote by J, f the integral of f o 7 evaluated 
on the measure space (Y, A’, 4’), and call aes f the integral of f on Y. 


7.5.1 Proposition. Let f ¢ L/,(X,A). Then fly € Li,(X,A) and f fly 
=Sy fF 

PROOF. Since |f1y| < |f], Corollary 7.4.3 implies that fly € L}(X, A). 
The result follows by verifying that the integrals agree on simple functions 
and passing to the limit.0 


7.5.2 Proposition. Let f >0, f € Li,(X,A), and set p(A) =f, f VAE 
A. Then p is a measure on X and p(X) < +00. 


PROOF. Finite additivity follows from the fact that 


14, +14, = lava, if A,N Ag =9@. 
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The theorem of Fatou-Beppo Levy implies that p is continuous on in- 
creasing sequences; this gives countable additivity. 0 


7.5.3 Proposition. Let A, be an increasing sequence of elements of A 
such that UAn = X. Let f € Li(X,A). Suppose that J, |f| ts bounded 
above by a constant C independent of n. Then f € Lj,(X, A). 


PrRooF. Since ||Tn(f)|lz1 < J, [f|, the result follows from 7.4.1. 0 


7.6 Lebesgue’s dominated convergence theorem 
Theorem. Let f, € Lj,(X,A). Suppose that 

(t) fn converges to h p-a.e. 
and that 

(ii) Ag € Lj,(X,A) such that |fr| < g Vn (domination hypothesis). 
Then he L', 


(iit) Ilfn — Allz: > 0, 


and 


(iv) [ao fe 


PrRooF. It follows from 5.1.2 that h € L°(X, A). By (ii) and 7.4.3, h € L!. 
As in 7.5.2, we introduce the measure p associated with g: 


Let {A,,} be an exhaustion sequence for X: A, C An4i and p(A,) < +00. 
Then p(An) — p(X) < +00. Fix m such that 


€ 
plAS,) < 5. 
For this fixed m, we will apply Egoroff’s theorem (5.1.4) to A,,. We can 
find a sequence {K,} of sets in A such that Ky C Kg41, fn > fo uniformly 
on Ky, and u(KEN Am) <qv!. 
Set Gg = KjNAm. Then {G;} is a decreasing sequence; setting H = G4, 
we have lim p(G,) = p(H). But u(H) = 0, whence g-1y = 0 p-a.e.; ie., 
g-1y =0 in L}, and p(H) = 0. Fix qo such that 


€ 
p(Gao) < §- 


The identity 
ix = dace, Fix, + 1e;, 
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[it = h| =| +f +{ lfn an! hj. 
AS. Rag Gay 


Using the upper bound 2g for the function |f,, — A] in the first and last 
integrals, we obtain 


gives 


Ifa — Alls < 2pCAS,) + 2p(Gay) + | Ifa — Al. 


Ky 
Each of the first two terms is bounded above by ¢/3. Furthermore, 


[ u-as ( sup [falz) a) HK). 


rE Ra, 


The last term tends to zero as n — +00, proving (iii). Finally, (iv) follows 
from the continuity of the integral with respect to the norm || - |;: (cf. 
6.8.3(ii)). 


7.7 Fatou’s lemma. Let f,, € L/,(X,A). Suppose that 


(i) \\fnller < C. where C is a constant independent of n, and 
(it) fr converges t-a.e. to h. 


Then 

(iti) hE L! and |All: < C. 

PROOF. We prove this first with the additional hypothesis 
(iv) U(X) < +00. 


In this case, convergence a.e. implies by Egoroff’s theorem that, for every 
integer g > 0, there exists K, C X such that f, converges uniformly on 
K, to hand p(K4) < e Thus 


fo imi- fim 


q 


< (Ky) sup | fn(2) — h(x)], 


Since f,,(2) converges uniformly to h(x) on Kg, the last expression tends 


to zero, whence 
/ lA, <C. 
- 


. q 


Set hy = |h|-1x,. Then {hg} is an increasing sequence since Ky C Ky41, 
and the Fatou-Beppo Levi theorem implies that 


lim hg =ho € L' and |Pollz, <0. 
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If X does not have finite measure, take an exhaustion sequence for X: 
X= UA,, Ar ie Area, L(A,) < +00. 


For each fixed r, set fp = fn1a,; then ||fri|z1 < C. Fatou’s lemma for finite 
measures can be applied to A;, giving 


hv =limf? =hla, €L' and |fh"Ilp1 <C. 


The conclusion follows by applying the Fatou-Beppo Levi theorem to the in- 
creasing sequence K, = |h|1a,. 0 


7.8 Applications of the dominated convergence theorem 
to integrals which depend on a parameter 


7.8.1 Integral notation in which the measure sz appears 


Up to now, we have dealt only with functions defined on the measure 
space (X,A, 4). When we consider functions defined on different spaces, 
the integral notation used earlier can lead to confusion, and we denote 


i by [ fe rau for all SE L)(X,A). 


7.8.2 Integrals depending on a parameter 


Let (X, A, 1) be a complete measure space. Consider a metric space Y and 
let 


u(y) = , k(0, yale) 


be an integral depending on the parameter y. Suppose that 


(i) for each fixed y the function ky(a~) = k(a,y) satisfies ky € 
LK A). 


Then u(y) is a well-defined function for every y. 


7.8.3 Proposition (Continuity of an integral depending on a pa- 
rameter). Assume condition (i) of 7.8.2. Let yo € Y and assume in addi- 
tion that 


(tt) for every sequence Yn — Yo, 
k(2,yn) > k(x, yo) p-ae.; and 
(iii) there exist g € Lj,(X,A) and «> 0 such that 
Ik(z,y)| Sg(t) if d(y,yo) <e. 


Then the function u is continuous at yo. 
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PROOF. Since uw is defined on a metric space, in order to show continuity at 
y it suffices to prove that u(y,) — u(yo) for every sequence {y,,} converging 
to yo. Set fr(x) = k(x, yn). Then the dominated convergence theorem (7.6) 


can be applied and 
[fe pats | tw 


7.8.4 Proposition (Differentiability of an integral depending on a 
parameter). Let Y = (yo—€, yo +e) be an open interval in R, and suppose 
that the following three conditions hold: 


(i) 7.8.2(i) is satisfied Vy € Y. 
(ii) For -almost every x, ACE yo) exists Vy € Y and is continuous 
at yo as a function of y. 
(iti) 3g € L),(X.A) such that, for p-almost every x, [SE (a, y)| < 
g(x) for everyy EY. 


Then u is differentiable at yo and 

‘ Ok 

(iv) u' (yo) a4 By (o yo) du(a). 
x OY 


PROOF. In order to show that wu is differentiable, we must show that there 
exists 1 such that 
lim e~'[u(yo + €) —u(yo)] =. 


Since R is a metric space it suffices to show that there exists / such that, 
for every sequence {€,,} tending to zero, 


lim 7, *[u(yo + €n) — u(yo)] = 1. 


Making this detour lets us apply Lebesgue’s theorem, which was stated 
for sequences of functions. Fixing the sequence {e,,}, set 


cx lu(ao + &») —ulao)] =f Suleddutay, 


where 
fr(z) = 6, [k(a, Yo a €) = k(a, yo)]- 


Let K be the negligible set such that (ii) and (iii) are satisfied in K°. 
Then, for 7 € K°, fy, can be calculated using the mean value theorem: 


file) = see. 40 +6,(x)), where |@,(2)|<e, if we K°. 
y 
Thus it follows from (ii) that 


ak | ; 
fala) > (eo) if 2 eK 


8 Product Measures and the Fubini-Lebesgue Theorem Al 


Furthermore, by (iii), |fn(x)| < g(x), x € K°; thus 


Mfa(a)] <o(2) ae. and tim fy(x) = F(e,y0) ac 


Applying the dominated convergence theorem gives 


[ soterante) > fF @v)au(z).0 


8 Product Measures 
and the Fubini-Lebesgue Theorem 


8.1 Definition of the product measure 


Let (X41, Ai, ui) and (X2, Ao, 2) be measure spaces, let X = X, x X2 be 
the product space, and let A = A; ® Ag be the product o-algebra (see 
1.5). The product measure is a measure pz defined on the measurable space 
(X, A) and satisfying 


(i) (Ai x Az) = wi (Ai)u2(A2) if pi(Ai) < +00 (¢= 1,2). 


8.2 Proposition (Uniqueness). There exists at most one product mea- 
sure. 


PROOF. Let yz and jz be two measures satisfying 8.1(i). Then they coincide 
on rectangles and hence, by finite additivity, on disjoint unions of rectan- 
gles, that is on the Boolean algebra € of elementary sets. Let 


M={ZEA:p(Z)=fi(Z)}; then MDE. 
Let {Z,,} be an increasing sequence of sets in M. Then, by 3.2.1, 
M(UnZn) = lim p(Zp) = lim (Zp) = f(UZn). 


Thus M is closed under increasing limits. 
If we further assume that 


(2) ba(X1) <-+oo and [2( Xe) < +00, 


then 
w(X) = pn (X1)H2(X2) < +00. 
3.2.3 can be applied to prove that M is closed under decreasing limits. 


Hence M is a monotone class that contains €, and it follows by 1.5.5 that 
M = A, ® Az. 
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To complete the proof, it remains to lift the restriction (i). Let {Y,,} and {Z,} 
be exhaustions of X, and X2, and let pn and pin denote the restrictions of 4 and 
p to ¥, and Z,. Then, by the result above, 


Mn = Lin . 


Furthermore, Un(Yn X Zn) < +00 and Un(Yn x Z,) = X. Thus Y, x Z, is an 
exhaustion of X with respect to both y and yp. By 3.2.1, for all AE A 


u(A) = lim pin(AN (Xn x Zn)) 


and 


ji A) = lim fin(A (Xn x Zn): 


Since the two right-hand sides are equal, (A) = (A). 


Sections 


For fixed 21, let 2z, denote the injection of X2 into X defined by rg > 
(21,@2). For Z € P(X), let Z,, = iz/(Z). Zp, is called the section of 
Z over x. Letting a; be the projection of X onto X;, we have Z,, = 
m2(m,' (x1) NZ). 


8.3 Fundamental lemma. Let A € A= A, @ Az. Then 


(i) Ay, € Ag Var € X41. 
(it) Suppose that jig(X2) < +00 and set k4(x1) = Me(Az,). 


Then 
(iit) ka € L°((X1, A1)) VAEA. 


PROOF. Since A is generated by the rectangles R, Theorem 2.2.1 implies 
that i7'(A)z, is generated by {i;'(R)}. But {i7'(R)} = Ag; since Ag is a 
a-algebra, it coincides with the o-algebra it generates, whence (i). Let 


M = {B € A: kp(21) is a measurable function of x;}. 


The rectangles are in M, as are finite unions of disjoint rectangles; thus 
the Boolean algebra of elementary sets is contained in M. We now show 
that M is a monotone class. 

Let B, be an increasing sequence of elements of M. By the limit theo- 
rem (3.2.1), ka, (1) = o((Bn)x,) satisfies limkg, (71) = ke. (41), where 
Bs. = UB,. Hence k4,. (x1) is measurable with respect to x; by 2.5, which 
implies that A, € M. 

Since t2(X2) < +00, Theorem 3.2.3 on the limits of decreasing sequences 
can also be applied, and it follows that M is a monotone class. Since M 
contains the Boolean algebra of elementary sets, M = A by 1.5.5. 
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8.4 Construction of the product measure 

8.4.1 Theorem. Let (X1,A1, p41) and (X2, Ag, u2) be measure spaces. 
(i) Suppose that ju;(X1) < +00 and u2(X2) < +00. 

For every A€ A= A; ® Ao, set 


(ii) p(A) = fy, ka(ei)dyi(a1) where ka(x1) = pe2(Ar,)- 
(p(A) ts well defined by Lemma 8.3.) 
(itt) Then p ts a measure on A, of total mass 41(X1)pU2(X2) < +00. 


Moreover, 
(iv) p(Ay x Az) = 111 (Ay) u2(Aa) if A; € Aj. 


PROOF. Since p is a finite measure, it suffices to prove that the o-additivity 
axiom is satisfied. We begin by proving finite additivity. Suppose that 


A=A'UA"” and A’N A” =O. 
Then A’, 9 Af, = 0, whence ka: (x1) + kan (x1) = ka(a1) and 
p(A) = p(A’) + p(A”). 


Now let A? c A?t+! Cc... be an increasing sequence of elements of A. Set 
A® = UAP; then lim f (A?,) = (A™)z, and, by 3.2.1, k4»(x1) > kax (21) 
for all z;. Next, kav < k4p+1. Applying Theorem 7.1, the theorem of Fatou- 
Beppo Levi, 


tim f eae aa)dya (2) = [bax (x1)dui(@1), ie. lim p(A?) = p(lim A”). 


This property, together with finite additivity and 3.2.4, gives o-additivity; 
hence p is a measure. It is trivial to see that (iv) is satisfied. 0 
8.4.2 Theorem on reversing the order of integration 


Theorem. Let (X1, Ai, 41), (X2, Ao, p12) be measure spaces. Suppose that 
U1 (X) < +00 and pi2(X) < +00. Then, if A € A; ® Ag, 


a di (x1) fs Lalo, t2)dua(s)| 


= i. dji2(x2) fe La(e1.z2}din (2). 


PROOF. Although the hypotheses in 8.4.1 are symmetric in X; and Xo, the 
construction is not. 

Set la(r2) = i(Az,). Then o(A) = f la(x2)du2(r2) exists and defines 
a product measure by 8.4.1. By 8.2, 0(A) = p(A) VAE A. O 
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NOTATION. The product measure is denoted by p41 @ 2. By definition, for 
all A € A; @ Ag, 


/ 14(x)d(p41 © p2)(2) 
x 


[ancy | 14(@1, 2)dp2(x2) 
8.4.3 es Xe 


dpi2(r2) | L(x, U2)dpy (21). 
Xo Xi 


8.4.4 Construction of the product measure in the general case 


If yz; and pg are not finite measures, let X{ and XJ be exhaustions of X, 
and X2. Set w? = Lxni. Then yp? (Xi) < +00, i = 1,2. We can define 
pt ® wy and set 


(141 ® Ha2)(A) = lim(ut ® #z)(A)- 


8.5 The Fubini-Lebesgue theorem 


Theorem. Suppose that (X1,A1, 11) and (X2, Az, 2) are measure spaces. 
Set X = Xx Xo, A= A, @ Ap, and pp = [1 ® pla, and let (X,A,p) be the 
product measure space. Suppose that 


(2) fel: 
(ii) Then fz, : 22? f(t1,%2) satisfies fr, € L°(Xo, Az) Va, € X1. 


Now suppose that 
(iii) f € Li (X, A). 
Then the following two properties are satisfied: 


fe, € Lj, (X2,A2) pi-a.e. in 1, and 
(iv) ke Li (X1,A1), where bes) = | f (a1, %2)dji2(z2). 
X2 


; [ tlevreidutenny = fainter if. Hlerz)dun(ea) 


( = i dji2(x2) fe Flas .a)dua(er). 


Conversely: 


Suppose that (i) holds, fr, € Lj,,(X2,A2) M1-ae., 
and there exists k* € L}, (X1,A1) such that 


(vi) 
ip LF (1, 22) dn(t2) < k*(01). 


Then (iti) is satisfied, and hence (iv) and (v). 
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REMARK. Denote the functions satisfying (ii) by Q and the functions sat- 
isfying (ii), (iii), (iv), and (v) by R. Then Q and R are vector spaces. Since 
the indicator functions of measurable sets are in Q by 8.3, so are finite 
linear combinations of indicator functions: E(X,A) Cc Q. 


PROOF. First assume the following stronger hypothesis: 
(2)’ f €£°(X,A) and f is bounded. 


Then, by 6.4.1, f is the uniform limit of a sequence of simple functions y”: 
er =D labday. 
By the remark, y” € Q for each n; that is, 


(y”)zx1 € L°(X2, Ad), Va, € X. 


Since (f)z, = lim(¢n)z,, 2.5.1 shows that (i)’ > (ii). 
Similarly, using 6.8.1(iv), hypothesis (iii) can be replaced by this stronger hy- 
pothesis: 


(iit)’ —f satisfies (i) and {x : f(z) #0} C Ai x Az, with i(Ai) < +00. 


Let {y~"} be a sequence of simple functions which converge uniformly to f and 
for which yp" (x) = 0 if c ¢ A; x Az. Then »” satisfies (iv) and (v). 
Since y” — f uniformly, there exists a sequence {e,,} such that en | 0 and 


lf — "| < €nlaixAa- 


/ I eMltn sen f dj = €np2(A2). 
x A, x Ag 


Thus 


Similarly, 


i: ler, = fe, |du2 x en f du2(x2) = €nf2(A2), 
X2 


Ag 


whence f Yr, 42 converges uniformly to f fe, dz. It follows from 2.5.3 that the 
left-hand side of the formula in (iv) is measurable. Repeating the same argument 
a third time for the integration in x; gives (iv) and (v). Summarizing, we have 
shown that (iii)’ = (v). 

Let {A[} and {A$} be exhaustion sequences for X; and Xz. Then {A?} = 
{A} x A$} is an exhaustion sequence for X. Let T, be the truncation operator 
defined in 6.7. Then T,(f) satisfies (iii)’. 

Suppose now that (iii) holds and that 


(vii) f>0. 


{Tp(f)} is an increasing sequence of functions in L}, and ||Tp(f)|lz1 < |Ifllz.- 
Since T,(f) satisfies (iii)’, (v) holds and 


| CaaS a Ghee RGR / (Ty(f))nrdur. 
X1 X2 
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As the sequence {k,} is increasing, the Fatou-Beppo Levi theorem (7.1) applied 
to X, shows that 


lim kp = ke € Lj,,(X1,A1) and / koe = lim ||Tpf ilo. = || Fl. 
x 


ee 


where the limit of the kp(1) is finite if x; ¢ B for some B € Aj, ju (B) = 0. 
Fix 21 ¢ B and apply Fatou-Beppo Levi on the space X2: 


hoo(1) = lim | (Tarde =f tim(Tp())n.di2 = [nda 
Xo Xo Xo 
We have thus proved (iv) and (v) when f satisfies both (iii) and (vii). If (vii) 
is not satisfied, write f = f* — f~; then f*, f~ © R, and by the remark f € R. 
It remains to prove the converse. Letting f satisfy (i), set f! = |f|. Using the 
truncation operator T,, we have 


[OP rede < [sereadldyatea < k* (2x2). 


Moreover, since T, ft € Li, we may use the identity (v) to obtain 


[etaem fan for neva as [Cardston 
x 


Hence the norm of T,(f') is bounded, with a bound independent of p, and the 
integrability criterion 7.4.1 implies that f! € Lj}. Since f € L°(X, A), 7.4.3 
implies (iii). 
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9.0 Integration of complex-valued functions 


Let f(z) = u(x) + iv(x) be a complex-valued function. Then f is a mea- 
surable mapping from X to C if and only if u and v are measurable. 
Furthermore, we say that f is integrable if u and v are integrable, and set 


(i [tq forife 


The integral f + f f is a C-linear functional on the space L1(X, A, u;C) 
of complex-valued integrable functions. Moreover, setting 


Zy = {we X: f(x) £0)}, 


Z; € Aand the function arg f(x) is well defined for x € Zs. The argument 
is defined to be zero on Z. Thus, if f € M,(X,A;C), we can write 


(it) f(x) = w(x, 
where w € M,(X,A;R*), 6 € M,(X. A; (0, 27)), and |f(a)| = w(2). 
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(iii) Lemma. Let f be a complex-valued integrable function. Then |f| is 


integrable and 
[fas fin 


ProoF. |f| < |u| + |v| and is thus dominated by two integrable functions, 
hence integrable. Set 
/ f=re'®; 


[il-o [r= fre 


Using the decomposition (ii), 


then 


| [tf = Ref wlaseos(a(a) ~ edula, 


Since |cos(@ — y)| < 1, we obtain 


| / i < f w(e)dn(o).0 


NOTATION. The complex-valued integrable functions will be denoted by 
EX, A;C). 
9.1 Definition. Let (X, A, 4) be a measure space. Let p be a real number, 
1l<p< +o. 
Let 
_ 0 : 1 
Thx, A) =, {f € D(X, A) : |f |? € L,(X, A)}. 


isle = (| yr) 


It is clear that ||f||z» = 0 implies f = 0 and that Jlal|z» = fal ||f\lz» for 
every constant a. 

Complex-valued functions with integrable pth power can be defined sim- 
ilarly: 


Set 


LP(X, A; C) = {f © M,(X,A;C) : |f|? € D4}. 


Writing f = u+iv or f = we’, we obtain the equivalences 


FELIX, AC) ue Li(X,A) and ve Li(xX,A) 
f € LE(X,A;C) owe LR(X,A) and 6€ L)(X,A). 
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9.2 Convexity inequalities 


9.2.0 This section is devoted to proving the inequalities of Holder and 
Minkowski. When p = 2 these inequalities become very easy. (Cf. Exer- 
cises, Cauchy-Schwarz inequality. ) 


9.2.1 Definition. A continuous function y defined on [a, 6] C R is called 
conver if y', (x) = limeo(y(z + €) — y(2))e7! exists Vx € [a, b) and ¢’, (x) 
is an increasing function. In particular, if y is twice differentiable, then y 
is convex if and only if y” > 0. 


9.2.2 Lemma (Jensen’s inequality). Let y be a convex function on 
[a,b] CR. Let a, (1 < k <n) be positive numbers such that Sy ax = 1. 


Then . 
yp (>: ots) < So axy(te) Vth € [a, 6]. 
k=1 k=1 


REMARK. This inequality may be taken as a definition of convex functions. 


PROOF. We prove the lemma for the case n = 2. Let a and b be constants 
and set 
p(t) = p(t) + at + b. 
Then ¢ is convex. Choose a and 6 so that (ti) = 9(t2) = 0. Jensen’s 
inequality reduces to showing that 


p(t) <0 for t) <t< tg. 


Otherwise the maximum of ¢ would be strictly positive and would be at- 
tained at a point t3 € (ti,t2), and we would have 


Py(t3)=0, P(ts) > 0. 

Since ¢', is increasing, ¢',(t) > 9'.(t3) = 0 if t € [tz, ta), whence Y(t2) > 
(tz), a contradiction. We proceed by induction on n. Assuming that the 
inequality holds for n < p, we prove it forn =p+1. 

Set € = 8-1 (57? _, ajt;), where B = S7P_, ay. 

Then, by the result for n = 2, p(B€+ap+itp1) < Be(E) + Op419(tp41)- 

The first term on the right-hand side can be bounded above by using the 
induction hypothesis, which gives y(€) < 7?_, G-'aiy(t:). 0 
9.2.3 Corollary. Let €,, £2 > 0 and let a, 8 > 0 satisfya+@=1. Then 
ERE? < aks + Bba. 


Proor. If €; = 0, the left-hand side is zero and the inequality is obvious. 
Suppose that €; > 0 (i = 1,2), and set 7; = log €;. The exponential function 
exp(t) satisfies the hypotheses of 9.2.1, whence 


exp(am + Bn2) < aexp(m) + Bexp(n2).0 


9.2.4 Lemma. Let (X,.A, 14) be a measure space, let a, 8 > 0 be such that 
a+ @8=1, and let f and g be nonnegative functions in EX A). Then 
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(i) f%g® € Li(X, A) and 
(is) f fe98 < (ff)* (f9)”. 


ProorF. If f = 0 a.e., both sides of the inequality (ii) are zero. Hence we 
may assume that || f||z1, > 0 and ||g||z1, > 0. Setting 


f=Wfiitft, 9 =llglizts. 
we reduce the proof of (ii) to showing that 
[Pes 
We will use 9.2.3. For every x, f%(x)g9(x) < af (x) + 89(z). 


The right-hand side is an integrable function; hence (i) follows from 7.4.3. 
Integrating both sides of this inequality gives 


[resafi+e fa 


[fie sa+=10 


Since f f = f7=1, 


9.2.5 Definition of conjugate exponents 


Definition. Let 1 < p < +00 and 1 < gq < +00. We say that p and q are 
conjugate exponents if 


REMARKS. p is conjugate to itself if and only if p = 2. 
Ifl<p< 2, then q> 2. 


9.2.6 Theorem (H6lder’s inequality). Let (X, A, 4) be a measure space, 
let p and q be conjugate exponents, and let f € L?, g € LY. Then 


(i) fg € L* and 
(ii) |f fol <(lFllzellgilce- 


PROOF. Since the theorem is clear when p = oo or q = 00, we may assume 
that 1 < p < oo. We first consider the case where f and g are nonnegative. 
Setu=f?,v=gl,a= - B= a Then fg = u%v%, and applying 9.2.3 
gives the theorem. 
In the general case, set |f| = fi, |g| = gi- Then fig: € L! by the 
argument above; hence by 7.4.2 fg € L! and 


[1 < f fio. 
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9.2.7 Theorem (Minkowski’s inequality). Let (X,A,,) be a measure 
space and let f, g € L?, where 1 <p<+oo. Then 


(i) (f +9) € L” and 
(ii) \|f + glee < Wf lle» + Ig 


LP: 


PROOF. The theorem is true for p = 1 by Proposition 6.8.3. Note that the 
function y(t) = t? is convex on [0, +00). Using Jensen’s inequality, we have 


eae) Datla) 
2s ives a 
( 2 S58 + 58: 


whence 
f(x) + g(a)|? < (\F(@)| + lg(@)|)? < 2°-7| F(a) |? + 2°" |g(a)”. 


Hence the integrability criterion 7.4.3 implies (i). It suffices to prove (ii) in the 
case that f and g > 0. We then have 


fo igi = fru grt pos $9). 


Letting gq be the conjugate exponent and using Holder, 


l/p 1/q 
[itror Le) (fr+or) 


but, since p and q are conjugate, p+q = pq, or (p—1)q = p. Writing the analogous 
integral for g, we obtain 


furans |(f2)" fo)" foro)” 


If + 9llz> < (flle» + llgllze IF + gilt 


or 


If ||f + gllz» = 0, Minkowski’s inequality holds trivially. Otherwise we can 
divide both sides by || f + gles to obtain 


If + 9len?!* < Wf lle + Ilglize, 


and the conjugacy relation gives p 
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REMARK. Writing f(r) = w(x)e™ shows that the Hélder and Minkowski 
inequalities remain true for complex-valued functions. 


9.2.8 Theorem. Let (X,A, 4) be a measure space and let 1 < p < +00. 
Then L?(X, A, ) is a vector space on which a norm is defined by the func- 
tion f > \|filze. 


PrRooF. It follows from 9.2.7(i) that L? is a vector space. Moreover, 9.2.7(ii) 
and 9.1 show that || - ||z» is a norm. 


9.3 Completeness theorem. Let (X,A,) be a measure space and let 
1<p<_-+too. Then Li(X, A) is a complete normed space. 


REMARK. For p = 1, Lebesgue’s theorem (7.2) implies that every normally 
convergent series in L! is convergent, and hence that L! is complete. 


PROOF. We proceed as in 6.5.4(v) by proving the following lemma: 


9.3.1 Lemma. Let {f,,} be a Cauchy sequence in L?.. Then {fr} converges 
in measure. 


PROOF. Fixing e, set 
{2 : lfn(x) = fri (x)| - e} = Ann’: 


Then 
/ fa — furl? > i. FPS As), 


sn! 


implying the Chebyshev-type inequality 
U(An,n’) pe oe | fellee 
Fix ng such that ||fn — fn'llzp < t+?” if n,n’ > no. It follows that 
e(fns fn’) < 2€ ifn,n’ > no. O 


9.3.2 PROOF OF THE THEOREM. Since L/ is a complete space, {fn} con- 
verges in measure (by 5.2.9) to fo. By 5.2.7, we can extract a subsequence 
such that 


(i) fn, converges to fp p-a.e. 
Since f, is a Cauchy sequence in L?, we have 
(ii) [Fnllze < C, or | |fnl? Inn < C. 


By Fatou’s lemma (7.7), |fo|? € L. 
Fixing k, consider the sequence {us} = {|fn, — fn, |?}. Fatou’s lemma 
can be applied since u, converges a.e. to |fo — fn, |?. We obtain 


|| lim ws||z1 < sup ||us|[r1- 
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Fix mo such that ||fn — fa|lze < € if n,n’ > mo. Take k such that 
Nz > Mo; then 
fo —fnyllne <e€ if n>mo 


and 
fo — fallze < Wlfo — frgllee + fn, — filer < 2€.0 


REMARK. Writing f = u + tv, we see that 9.3 implies that L?,(X,.A;C) is 
complete. 


9.4 Notions of duality 


Given a normed vector space E, the vector space E’ of continuous linear 
functionals | on E is called the dual of E. For | € E’, we set 


{|| = sup |l(x)| where |z|| <1, re FE. 


It can be shown that E’ is a Banach space. 


9.4.1 Theorem. Let (X,A,1) be a measure space. Then L2(X, A) is a 
Hilbert space when the scalar product is defined by 


(i) i fo = (fla). 


The scalar product for the complez-valued functions L?(X, A, ws) is defined 
by f £9 = (fla). 


ProoF. (f|f) = ||f||72, and Hélder’s inequality becomes 


(it) (fla) < I Fliceligiiz2- 


This is just the Cauchy-Schwarz inequality, which can be proved directly. 
Moreover, L? is complete, and hence is a Hilbert space. 


9.4.2 Corollary. The dual of the space L? can be identified with L?; the 
dual pairing is given by 9.4.1(i). 
PRoor. In a Hilbert space, by Riesz’s theorem! every continuous linear 


functional can be expressed by a scalar product. 


9.4.3 Proposition. Let (X,A, 4) be a@ measure space and let p and q be 
conjugate exponents, 1 < p< +00. Then there is an isometric injection u 
from L4, into (Lf)’. 


See, for example, W. Rudin, Real and Complex Analysis (New York: McGraw- 
Hill, 1974). 
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Proor. Define a mapping u : L4 — (L?,)’ by associating with g € Li, the 
linear functional 


it) = ff 
Then by Holder 


(i) of) < Ighzellfllee = Cllflice, 


which shows both that J, is a continuous linear functional and that u is a 
contraction: 


le(a)IIizey S IIgliza- 


In order to show that u is an isometry,we introduce fy = (signum(g))|g|7/?. 
Then | fol? = Igl*, lIfollz> = Ilgllia, and 


f toa = faces = figl =tallt. 


Hence 
Ig(fo) = |Igllia- 
Furthermore, 
llg(Fo)l < Wollcxey-lNfollee = WMgllczey lglg? 
whence 


Wglleney > llellfa?”. 


But gq — q/p = 1, and hence u is an isometry. 
It follows that u is an injective mapping of L? into (L?)’. 0 


REMARK. It will be shown in Section IV.6 that u is surjective, and thus 
identifies (L?)’ with L? (1 < p < +00). 


9.5 The space L° 


9.5.1 Definition. f € L(x , A) is said to be essentially bounded if there ex- 


ists a bounded representative f of f. The space of essentially bounded mea- 
surable functions is denoted by L7°(X, A). We define Ag.¢ = {x : |g(x)| > 


€} and K(g) = {€ € R™ : w(Ag.¢) = O}. 
If g < Lo, then K(g) 4 0 and we set 


IIglluge = inf K(g). 


9.5.2 Lemma. p1(Ag¢) > 0 if and only if € < Ilgile- 


ProoF. The only case that is not obvious occurs when € = ||g||z20. We then 
apply the continuity theorem for increasing sequences of measurable sets. 
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Setting 9 = Ilglln~ gives B (US Ag eo4n-1) = 0. But Une 1Ag,go4n-? = 


Ag. go: 


9.5.3 Proposition. LX (X,.A,) is a complete normed vector space. 


PROOF. We first prove the triangle inequality for ||-||,.0. Let f,g € Li and 
seth = f+g; then |h(x)| < |f(x)|+|g9(x)| implies that Af ¢,,, > AGeNAG.,). 
Taking complements, we obtain 


H(An.e+n) < L(Af.e) + H(Ag.n); 


or (€ +) € K(h) if€ € K(f) and 7 € K(g). Thus 


I[Pllnce < Wf llne + Ilgllne- 
If |[hl|z~ = 0, then h(x) = 0 a.e. by 9.5.2, and hence || - |[;~ is @ norm. 


Let fn be a Cauchy sequence in the norm || - || ize. Choose representatives f, of 
the class f,, and set Un.nt = fn— fr. Let Anne = {x : |tnm(x)| > 3||Unn’ liz }s 
then, by the definition, (A, n’) = 0. 

Set Z = Un. Anjn’. Then u(Z) = 0 and 


lfn(a) — fr (x)| <3] fx — fai lle if re Z, 


The sequence f, converges uniformly on Z°. Set fo(x) = lim fn(x) if x € Z° and 
fo(x) =O if ¢€ Z. Then fo € LY and || fn — follz — 0. 


9.6 Proposition. Let (X,A,) be a measure space. Suppose that p(X) < 
+oo. Then Lt (X,A)} D> LE (X, A) fl <p<p' < +o. 


PROoF. Use Holder’s inequality to write 


|1.x| 


| if lPdu(x) = / IPP Lxdy(2) <A? lle 
xX 


s 
Lis 


where r and s are conjugate exponents. If p’ < +00, note that 


If? ler = (fine) 


and take r = us > 1. Then 
Pp 


/ 


p'-p 
mee 


(7) flee < [H(X)]*lfllce, where a= 
This shows that every function in LP’ is in L?. If p’ = oo, note that 


i FP <[[fRaou(X).0 


II 


Borel Measures 
and Radon Measures 


Introduction 


The preceding chapter dealt with abstract measure theory; given an ab- 
stract set X, we rather arbitrarily prescribed the o-algebra B of its mea- 
surable subsets. In this chapter, we work in a space X which is locally 
compact and can be written as a countable union of compact sets. A natu- 
ral o-algebra in this context is the Borel algebra Bx. A locally finite Borel 
measure is a measure defined on Bx such that every compact set has finite 
measure. For X metrizable, we prove Lusin’s theorem: If yu is a locally finite 
Borel measure and A € Bx, then for every € > 0 there exist an open set 
O and a closed set F such that F Cc AC O and pu(O — F) < ¢. Thus an 
arbitrary Borel set can be approximated to within e by both an open and 
a closed set. 

A natural vector space on X is the space Cx (X) of continuous functions 
with compact support. A linear functional I on Cx(X) is called positive 
if I(f) > 0 for every nonnegative function f. We prove the Radon-Riesz 
theorem, which constructs a bijection between the positive linear function- 
als on Cx (X) and the locally finite Borel measures. In the Prologue, we 
showed that the Riemann integral on R defines a positive linear functional 
on Cx (R). In this chapter, we apply the Radon-Riesz theorem to obtain a 
canonical translation-invariant Borel measure on R, the Lebesgue measure. 
The theory of the Lebesgue integral appears as a special case of the theory 
of the abstract integral developed in Chapter I. We obtain the Lebesgue 
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integral on R” by constructing the product measure, and prove the change- 
of-variables formula for multiple integrals. 

When Y is compact, the space of continuous functions on Y is a Banach 
space. We consider the dual vector space (C(Y))* of continuous linear func- 
tionals on Y, and show that every linear functional can be written as the 
difference of two positive linear functionals. This leads us to the concept of 
signed Radon measures. 

Given a locally compact space X, we consider the Banach space C;(X) 
of bounded continuous functions on X and the closed subspace Cp(X) of 
functions which vanish at infinity. (Co(X))* is identified with the space 
M'(X) of finite signed Radon measures. Three topologies can be defined 
on this set by using the pairings with Cx (X), Co(X), and C,)(X). We 
compare the three corresponding notions of convergence. 

The first section of this chapter is devoted to the construction of parti- 
tions of unity, which allow the passage from local to global considerations 
on X. It is purely topological, while the rest of the chapter describes mea- 
sure theory on locally compact spaces. 


1 Locally Compact Spaces and Partitions of Unity 


1.0 Definition of locally compact spaces which are countable 
at infinity 
Let X be a Hausdorff topological space which satisfies the following hy- 
potheses: 
1.0.1 X is locally compact, i.e. every point x9 € X has a compact neighbor- 
hood. 
1.0.2 X is countable at infinity, i.e. there exists a sequence {,,} of compact 
subsets of X such that 

KnC Kn4i and (JK, =X. 


n 
1.0.3 Proposition. There exists a sequence H,,, of compact sets such that 
Ay, C Fk (where A denotes the interior of A) 


and = 

(ie Soe 

m=1 
PROOF. The proof is by induction. Set H; = K, and, assuming that H, 
has been constructed, set G, = H, U Ky. Each « € G, has a compact 
neighborhood V(x); from the open cover of Gy formed by {V(2x)}, extract 
a finite subcover. 
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This procedure gives points r4,; € Gg, 1 < j < mg, such that Hy C 


ie} 
Ur<j<m, V(@qj)- Set Ha+1 = Ur<jem, V(ea5)- AS the finite union of 
compact sets, H,41 is compact. Furthermore, 


fe) 
Has 2 U V(%q53) > Ag 


1<jsmq 


UH, > UK, =x.0 


1.1 Urysohn’s lemma 


and 


Lemma. Let F, and F2 be disjoint closed subsets of a locally compact space 
X. Then there exists a continuous function f on X such that 


f(z) =1 if and only if «cE Fi; 
f(z) =0 if and only if x € Fo; 
O< f(x) <1 for all rex. 


PROOF. We restrict the proof to the relatively trivial special case where X 
is a metric space. 
Let 
fi(x) = d(x, F;) = min(d(z,y;)), where y; € Fj. 
Then f; (i = 1,2) is a positive continuous function and fi(z) =0 = we 
ia 
Let a function ® be defined on Z = ([0, +00) x [0, +00)) — (0, 0) by setting 


& 
®(é,7) = —-—. 
(€,7) f+7 
Then ® is continuous since (0,0) is not in the domain of definition Z of ©. 
Furthermore, 


0<@<1, 
B(€,0)=1 if €>0, and 
®(0,n)=0 if n>O0. 


Let f(x) = ®(f1(z), fo(x)). Since F, N Fy = 0, the mapping into (R*)? 
defined by x + (f(x), fo(x)) actually maps into Z. Thus f is the compo- 
sition of continuous mappings and hence is continuous. 0 


1.2 Support of a function 


Definition. Let f be a continuous function on X. The support of f, de- 
noted by supp (f), is the closed set 


supp (f) = closure {x : f(x) # O}. 


1.2.1 Proposition. The following statements are equivalent: 
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(i) 2 € supp (f). 
(i) There exists a neighborhood V(z) such that f(x) =0 Vax € V(z). 
PROOF. Let O = (supp (f))°; then O is an open set and 
{x : f(z) F0}NO C supp(f)NO = 9, 


whence, setting O = V(z), we have shown [that] (ii) > (i). 
Conversely, if V(z) NM {x : f(x) #0} =, then 


V(z)n fe: f(a) #0} = 0.0 


1.2.2 Proposition. Suppose that X is a locally compact space, F is a 
closed subset of X, and O is an open subset of X such that F C O. Then 
there exists a continuous function g such that 


O<g(x) <1 for any rex; 
g(x) =1 if and only if xcxeEF; and 
supp (g) CO. 


PROOF. Set F’ = O°. Applying Urysohn’s lemma (1.1), let f be the function 
associated with the pair of closed sets (F, F’). Set 
" —1 1 


Then F’”’ is a closed set since f is a continuous function. Let g be the function 
associated by Urysohn’s lemma with the pair (F, F’’). Then g(x) > 0 implies 
x ¢ F", or f(x) > 4, which may be written as 


{a : glee) #0} c FCS, I). 


Hence supp (g) C closure (f~*)((4, 1)). 
Since f~'((3, 1]) is closed, we have a fortiori 


supp (9) C #51) CO. 


1.8 Subordinate covers 


1.3.0 Definition. Let {U,} be an open cover of X. An open cover {V,,} is 
said to be subordinate to {U,} if, for any n, there exists a(n) such that 


Vy, Cc Uacn): 
A cover {H,} is said to be locally finite if, for every compact set K, 
card {y: Hy 1K 49} is finite. 


1.3.1 Theorem. Let X be a locally compact space which is countable at 
infinity. Then every open cover has a locally finite subordinate open cover 
{V,,} such that the V,, are compact. 
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PROOF. Let {Ua} be an open cover of X and let {Hm} be the sequence of 
compact sets defined in 1.0.3. Set 


Gi=Hi and Gm = (Hm — Hn-1). 


Then 
Gm = Hm (Hm-1)° C Hm NHS _. 
But 
° c ° ¢ nea 
( Fm) > Hy,-1, whence ( Hm) ai eee 
so that 


Gm CHa n (Bn) ’ 


and thus Gm Hm—1 = 9. Using 1.0.3, 


(i) Gm ial An-2 — Q. 
Set 
(ii) Ua,m = Ua OM Hae nN HE 25: 


Then Ug.m is an open cover of Gm. 
For each x € Gm, there is an open set W(x) such that 


(2i2) W(x) C Ua where a=a(z). 


The W,,,(z) form an open cover of the compact set Gm; from this cover we can 
extract a finite subcover, say Wr,(%1),..., Wm(2j). 

The family {W,,(xx)} is a countable family of open sets, which we denote by 
{Vr}. We have Vn C Ua, where a = a(n). The {Vn} cover Gm for every m, 
hence cover X. For fixed m, (i), (ii), and (iii) imply 


(iv) card {n: Va NGm #O} < +00. 
We now prove a lemma. 
1.3.2 Lemma. Let K be a compact subset of X. Then there exists q such 
{e} 
that K C Hg. 


fe} Cc 
PROOF. Set F, = ( i) 1K; then 9,F, = @. 


The F form a decreasing sequence of closed subsets of the compact set K. 
Since their intersection is empty, there exists q such that 


fe} & 
0=F,=(4,) OK. 
1.3.3 CONCLUSION OF THE PROOF OF THEOREM 1.3.1.Given the compact 
set K, let g be determined by 1.3.2. Then (ii) and (iii) show that 
Wm(t)NK=0 if m>q-2, 


whence 
card {n: V, NK #0} <+00.0 
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1.4 Partitions of unity 


1.4.0 Definition. A partition of unity on the space X is a sequence of 
continuous functions y,, such that 


(i) 0S <1, 
(ii) supp (yp )is compact, 
(iii) card {n: K Nsupp (yn) #0} < +00 for every compact set A, and 


(iv) Syale) = 1. 


REMARK. Condition (iii) is called a local finiteness condition. It implies 
that, for fixed x, the series (iv) contains only finitely many nonzero terms. 
The partition of unity is said to be subordinate to the open cover Uy, if 


(v) for every n, there exists a(n) such that supp (Yn) C Uan)- 


1.4.1 Theorem. Suppose that X is a locally compact space which is count- 
able at infinity and {U.,} is an open cover of X. Then there exists a parti- 
tion of unity subordinate to {U,}. 


ProorF. Let {V,,} be the locally finite cover subordinate to {Ug} con- 
structed in Theorem 1.3.1. 

Since the V, form a cover, by another application of 1.3.1 there is a 
locally finite cover {L,} subordinate to {V,,} which satisfies 


Ls CVn, where n=n(s). 


Applying 1.2.2 to the pair (Ls, Vaca) there is a function g, such that 
supp (gs) C Vn and gs(z) = 1 if x € L,. Since each V, is compact and 
the cover {L,} is locally finite, only finitely many of the elements L, are 
contained in any V,,. Since the cover {V,,} is locally finite, 


card {n: Vz, K #0} < +00 


for any compact set K. Hence, setting I(K) = {s:supp(gs) 1K 4 9}, we 
obtain 
card (I(K)) < +00. 


Thus the sequence {g,} satisfies condition (iii). Set 
D(x) = >> 95(2)- 


To calculate D(x) on a given compact set K’, it suffices to let s range 
over I(K). As this set is finite, D(a) can be written on K as a sum of 
continuous functions; hence D(z) is continuous on K. Together with the 
local compactness of the space X, this implies that D is continuous. 

Furthermore, {L,} covers X. For every x, there exists s such that x € L,; 
that is, 

D(z) >1 forevery xe xX. 


2 Positive Linear Functionals on Cx(X) and Positive Radon Measures 61 


Setting 
gives a continuous function on X. Finally, set y, = Ggn. O 


2 Positive Linear Functionals on Cx (X) 
and Positive Radon Measures 


2.0.1 Notation 


Given a locally compact space X, Cx (X) denotes the vector space of con- 
tinuous functions with compact support. We write 


f>0 if f(x) >0 for every z. 


2.0.2 Definition. A positive linear functional is a linear mapping I : 
Cx(X) — R such that I(f) > 0 for every f > 0. 


2.1 Borel measures 


Let Bx denote the Borel algebra on X. A measure defined on Bx is called 
a Borel measure, and is said to be locally finite if 


2.1.1 p(K) < +00 for every compact set K. 


REMARK. Since K is closed, K € Bx. 


2.1.2 Proposition. Let y be a locally finite Borel measure on X. Then 
every continuous function with compact support is integrable. Setting 


I(t) = f fay 


defines a positive linear functional on CK (X). 


PROOF. Since f is continuous, it is By-measurable. Furthermore, |f| is 
bounded by a constant M, and setting K = supp(f) yields 


|f| < M1x. 


By 2.1.1, lx is integrable; by I-7.4.2, so is f. The positivity of I follows 
from 1-6.8.3. 0 


2.2 Fundamental theorem of Radon-Riesz. Let X be a metrizable lo- 
cally compact space which is countable at infinity. Then the correspondence 


wo ly 
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of 2.1.2 defines a bijection which allows the locally finite Borel measures to 
be identified with the positive linear functionals on CK (X). 


PROOF. This statement contains both an existence and a uniqueness theo- 

rem: Every positive linear functional is represented by an integral with re- 

spect to a locally compact Borel measure, and this representation is unique. 
The proof of Theorem 2.2 occupies the rest of this section. 


2.2.1 Approximation lemma. Let X satisfy the hypotheses of 2.2. Then 
for every open set O in X there is an increasing sequence of compact sets 
K,, such that 


(i) OSU and Kee Kee. 


For every compact set kK in X, there is a decreasing sequence of open 
sets O,, such that O,, is compact, 


(ii) K=N0,. and Oy C On-i. 


PROOF. Set G, = {x : d(x, O°) > +}; then G,, is closed. Let K, = G,NHn, 
where {H,,} is the sequence of compact sets of 1.0.3. 

Then Ky D> Cn a) Hi D Gy-1O Hy~-1, and (i) is satisfied. 

To prove (ii), let m be determined as in 1.3.2 so that K C io and set 


oO 


On Fe {2 :d(a, BK) < =} 


2.3 Proof of uniqueness of the Riesz representation 


Let yz and v be locally finite Borel measures such that 


2.3.0. [ flordute) = f separ), Vf €CK(X). 


2.3.1 Proposition. Suppose that 2.8.0 is satisfied. Then the measures 
pt and v coincide on open sets and on sets which can be written as the 
intersection of an open set and a closed set. 


Proor. Using the approximation lemma 2.2.1(i), we can write O = UK). 
For every pair (Kn, Roti, let gn be determined as in 1.2.2: 


Gn(z) = 1 if re Ky, 
supp (gn) C Kn41, and O0<g, <1. 


Then 1x, < gn < 10, whence 


pK) < / Gigi gO): 
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Since, by I-3.2.1, 
lim u(Kn) = 4(O) < +00, 
it follows that 


(2) tim [andy = u(O) and similarly tim [gud = V(O). 


Since g, € Cx(X), 2.3.0 implies that the left-hand sides of the two equa- 
tions are equal; thus v(O) = u(O) < +o0. 

Let A = F MO, where O is open and F is closed. Using the exhaustion 
principle (I-3.2.4) and setting 


F,=FOH, (Ap defined as in 1.0.3), 


we have 
WF OO) = lim p(F, NO), 
whence it suffices to show that 
w(K OO) =v(K NO) 
for every compact set K. 

By the approximation lemma, 2.2.1(ii), there exists a sequence {O,} of 
open sets with compact closures such that K = lim | O,. Since the O, are 
compact, (O01) < +00; it follows from the principle of decreasing sequences 
(1-3.2.3) that 

w(K OO) = lim w(ON O,), 
n 
and from the first half of the proof that 
b(ONO,) =Vv(ONO,).O 

For convenient reference, we restate the first part of the proof of 2.3.1 in 

a more organized form. 
2.3.2 Constructive definition of (O) 
Let O be an open set in X and let 
T(O) ={f € Ce(X):supp(f) C O and 0 < f < 1}. 
Then 
p(O) = sup | fay where f €T(O). 


PROoF. Set L = [few where f € T(O). Since f € T(O) implies f < lo, 
we have 


if tau <u(O), whence LD < p(O). 


Furthermore, the g, constructed in the proof of 2.3.1 satisfy g, € T(O). 
Thus 


lim f gd <L, whence by 2.3.1(i) w(O) < L.0 
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2.3.3 Terminology 


Subsets of X which can be written as the intersection of an open set and a 
closed set are called sets of type o.c. Open sets and closed sets are special 
cases of o.c. sets. (Take their intersection with X.) A subset of X which can 
be written as a finite union of disjoint o.c. sets is called an elementary set. 
It follows from the additivity of 4 and vy and from 2.3.1 that w(E) = v(E) 
for every elementary set €. 


Lemma. The elementary sets form a Boolean algebra of subsets of X. 
REMARK. Compare I-1.5.4. 
PROOF. 


(i) Let R be an o.c. set. Then R° is an elementary set, for if R = ONF, 
then R° = O° U F° and we can write 


Ro =(OCN FE) U(ONF)U(ONF’). 

The three sets in parentheses are disjoint and each is of type o.c. 
(ii) The intersection of two elementary sets is elementary. Indeed, let 

E =U;R; and €' = U; Rj, where R; = O; N F; and Ri = O; N Fy. 

Then 

EME S Ung Rit Ry 
Since the Ri, Rj are disjoint, so are the R;1 Rj. Moreover, Ri NR), = 
(0; 05) N (Fi 9 Fj) and hence is of type o.c. 


(iii) The complement of an elementary set is an elementary set. If € = 
UR; then €° = Rj. By (i), each Rf is an elementary set. By (ii), E°, 
as the intersection of finitely many elementary sets, is elementary. 


(iv) X is of type o.c. (hence elementary). 


(v) A finite union of elementary sets is elementary. By (iii), it suffices 
to prove the statement for complements of elementary sets; but this 
follows from (ii). 


2.3.4 Proof of the Radon-Riesz theorem (uniqueness) 


Proor. Let B= {A € Bx : (AN H,) = u(AN An) Vn} (where H,, was defined 
in 1.0.3). 

We first show that B is a monotone class. This is immediate for increasing 
sequences, by I-3.2.1. Now let {As} be a decreasing sequence, A, € B. Then, by 
the compactness of Hy, 


(As Hn) <+o00 and v(AsM Hn) < +00. 
Applying 1-3.2.3, 


lim p(A, 9 An) = w((lim | As) A), 
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whence 
w((lim | As) Hn) = v(lim(| As) NM An). 


As this is true for any n, we have (lim | A.) € B. 

Furthermore, B contains the Boolean algebra of the elementary sets of X by 
2.3.1. Therefore, by I-1.4, B coincides with the o-algebra generated by the open 
sets and the closed sets; that is, B= Bx. 0 


2.4 Proof of existence of the Riesz representation 


Given a positive linear functional I on Cx (X), we would like to represent 
it in integral form. We begin by using a construction that appeared in the 
proof of uniqueness. 


2.4.1 Measure of open sets 


As in 2.3.2, we set 

T(O) = {f €Cx(X):supp(f) CO and O<f <1}. 
Given a positive linear functional J, we define 
(1) I(O) =supI(f), where f €T(O). 


I(O) is called the measure of the open set O relative to the linear form I. 
Note that 


(ii) TOVSAO) AE Oveoy 


(iii) Proposition (Convexity inequality). Let {O,} be a sequence of 
open subsets of X. Then 


i (U 0.) < 5° 1(On). 
PRooF. Set W = U,On. Let f € T(W) and set 


Q = (supp (f))°*. 


Then 2,{O,} form an open cover of X. Let y, be a partition of unity 
subordinate to this cover. Set 


fa = fq: 


Let S = {q: gf # 0}. Since f has compact support, card(S) < +00. If 
q€S, let q+» @(q) be a mapping from S to N such that 


supp (Pq) C Oa(q)- 
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Set 
S(n) =07!(n). 


The nonempty S(n) form a partition of S. Set 
Vn = Ya: 
qe S(n) 


Set J 
(fn 


A(S) and f, = fun. Then f < ><, fn and, since fy, € T(On), 
(On) and 


1A | 
by, 


I(f) < $2 1(On) < S> 1(O,).0 


ned 


(iv) Proposition (Additivity of /). Let O; be a sequence of disjoint 
open sets, and set O = UO;. Then 


1(0) = $9 1(0;). 


PrRoor. Given n and e, consider the nth partial sum of the series on the 
right-hand side and choose f; € T(O;) such that 


I(fi) > I(O;) = e27*, 
Then f = )7'_, fi satisfies f € T(O), whence 
KO) = 1) = Soh) = D0.) -€ 
i=l i=l 


Since n and € are arbitrary, we obtain 


which together with the convexity inequality gives (iv). 0 
2.4.2 Measure of compact sets 
Let K be a compact subset of X and set 

i(k) =infI(O), Oopen, OD K. 


Then 


(i) Ay C Keimplies!(K,) < I(K2); and 
(ii) if AK is compact, O is open, and K Cc O, then I(K) < I(O). 
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(iii) Proposition (Finite additivity). Let K,, Kz, ..., Ky be a finite 
collection of compact disjoint sets. Then 


I (U «:) = SK). 
i=1 i=1 


PROOF. Let 2€ denote the infimum (minimum) of the distances from K; to K; 
and let 
U; = {x : d(x, Kj) < e}. 


Then the U; are disjoint open sets. 
Choose O; such that O; > K; and I(O;) < I(Ki)—¢27', and set Of = U;NO;. 
Let K = UK, and choose O such that I(K) > I(O) — «. Set Of = ONO}; 
then K Cc UO! Cc O, which implies that 


I (U 0) SS KY <I (Uo!) 


Since the Of are disjoint, 2.4.1(iv) implies 


I (U 07) =~ 1(04). 


Since 
Kj Cc O; Cc O;, 
we have ; 
I(O4) — 277 < I(K;) < 1(0%), 
and thus 


DE HOF) ~€< DIT) < Y7107).0 


2.4.3 Inner measure and outer measure 


We would like to define set functions for arbitrary subsets A of X. We set 


p*(A) = inf I(O), Oopen, ODA, and 
ptx(A) = supI(K), K compact, K CA. 


Then by 2.4.2(ii) 
(i) tx(A) < (A). 


[4(A) is called the inner measure of A and ju* its outer measure. 


(ii) Proposition (Convexity inequality for y*). Let {A,,} be a sequence 
of subsets of X. Then 


ue (Ua) < So ut (Ai)- 
i i=l 
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PRoorF. Choose a sequence of open sets {O;} such that 
A; CO; and I(O;) < p*(A;)+¢€ 27°. 


Let A = U;A; and let O = UO;; then A C O, whence p*(A) < I(O). 
By 2.4.1 (iii), 


1(0) < So 1(0;) < Sou (Ai) +e. 


(iii) Proposition (Concavity inequality for p.). Let {A;} be a se- 
quence of disjoint subsets of X. Then 


+00 +00 


PROOF. Consider the nth partial sum of the series on the right-hand side. 
Fix compact sets K; such that 


K,C A; and I(K;) > p.(Ai) — € 27. 


Let K = U?_, Kj; then K is compact. Since the A; are disjoint, so are 
the K;, and finite additivity (2.4.2(iii)) gives 


I(K) = 01K) 2 oA) = 


Since AK is compact and K Cc A =U;Aj, we conclude that u.(A) > I(K). 
a) 


2.4.4 Construction of the measure (compact case) 


Throughout this section, we assume that 


(H) X is compact. 


(2) B= {Ae P(X): p(A) = us(A)}. 
If A € B, we set 
(ii) H(A) = p"(A) = px (A). 


(iii) Proposition. A € B if and only if for every € > 0 there exist a 
compact set K and an open set O such that 


KCACO with I(O)—¢<I(K) <1(O). 
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PROOF. We prove sufficiency; the proof of necessity is similar. If A € B, 
there exists a compact set K such that K C A and I(K) + § > p.(A). 

There exists an open set O such that O > K and p*(A) > I(O) — §. 
Hence the fact that y.(A) = u*(A) implies that 


1(0) -e< I(K) < 1(O).0 


(iv) Proposition. Every closed set is in B. 


Proor. Let K be closed (hence compact). Then p,(K) = I(i) by defini- 
tion, and 


w(K) = inf 1(0) = 1(K) 


by definition of I[(k). 0 
(v) Proposition. Every open set is in B. 


PROOF. Let O be an open set. Formally, u*(O) = I(O). 

Furthermore, given « > 0, by the definition of [(O) there exists g € T(O) 
such that I(g) > I(O) - 

Let K be the support of g. Then g € T(Q2) for every open set 2 D K. 
Hence I(g) < [(Q) VQ D K; that is, 


I(g) < inf 1(Q) = I(K). 


Thus 
I(K) > I(g) 2 1(O) - 


and therefore 


Hx(O) 2 w*(O) ~ 6.0 
(vi) Proposition. Let {A,} be a sequence of disjoint elements of B. Then 


UnAn €B and u(UnAn) = DH(An). 


PROoF. p* (UAn) < 30, w(An) by the convexity inequality, and yu. (UA,) > 
yen H(An) a the concavity inequality. 
Setting Z = UAn, we thus have y.(Z) > w*(Z), whence Z € B and 


diMAn ) S pe(Z) = w(Z) = w"(Z) < So (A, ).0 


We now refine criterion (iii). 


(vii) Lusin’s criterion. Let A € P(X). Then A © B if and only if for 
every € > 0 there exist a compact set K and an open set O such that 


KCACO and p(O-K) <e. 
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PROOF. By (iii), we can find K Cc A C O such that 
pw(O) < pK) +. 


But (O — A) and K are disjoint and belong to B (because (O — Kx) is 
open and Kk is closed), whence by (vi) 


m(O— K) +p) = w(O), or p(O- K) = n(O) — p(k) < 60 


(viii) Proposition. B is a Boolean algebra. 


Proor. We will use Lusin’s criterion (vii). We first show that A‘° € B if 
A € B. There exist a compact set A’ and an open set O such that 


KCACO with p(O-K)<e. 
Then 


Of C ATC K® and K°-O*° =O-—K, whence 
Bb(KS — O°) = w(O- K) <e. 


Similarly, let A, A’ € B; then KU AK’ C AUA’ COUO’ and 


(OUO)N(K UK’) (ON(KUK'))U(O'N (KU R‘)*) 


Cc (ON K)U(O'NK’),. 
Hence, by the convexity inequality for the outer measure, 
w((OUO)A(K UK’) < w(O-— K) + (0! — KB’). 


Since all the sets in this expression are in 6, we can replace y* by ju to 
obtain that AU A’ satisfies (vii); hence AU A’ € B. 


(ix) Theorem. Suppose that X is a compact space and B is the family of 
sets defined in (1). Then B is a o-algebra containing the Borel algebra and 
pt defined in (ii) is a measure on B. The o-algebra B is -complete. 


ProoF. It must be shown that a countable union of sets A, € B is in B. 
Set 
a te: 
Bi, =A, Bn =AnN (UBT Aj) . 


Then UB,, = UA, and, since B is a Boolean algebra, B, € B. 

Since the B, are disjoint, it follows from (vi) that their union is in B. 
Thus B is a o-algebra. By (vi), is a measure on B. By (iv), B contains 
the closed sets; therefore B contains the Borel algebra Bx. Next, let 


Y CA, where AEB and p(A)=0. 


Then 
w(Y) < (A) =0. 
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Furthermore, by 2.4.3(i), 


b(Y) < w"(Y), 
whence 
y(¥) = u"(¥) = 0. 
Thus Y € B, and hence B is p-complete. O 


(x) Definition. The measure z constructed in Theorem 2.4.4(ix) is called 
the Radon measure associated with the positive linear functional J. The 
o-algebra B on which the Radon measure y is defined contains the Borel 
algebra Bx. By restricting u to Bx, we can associate a Borel measure p’ 
with yp. The a-algebra B is the completion of Bx with respect to the measure 
’; this will be proved in 3.4.2. 


2.4.5 Proof of the representation theorem (compact case) 


Theorem. Let X be a compact space, let I be a positive linear functional 
on C(X), and let w be the Borel measure associated with I by 2.4.4(ix) and 
(x). Then 


jf fu = 18) 


PROOF. We will show that 
(2) (fy < [fav forevery f €C(X). 
For a given € > 0, let 
Ap = f~'({ke,(k+1)e)), where |k| < N, 


with N chosen so that M = max|f| < Ne. Set 


or (((b- ses) 


Then Of is open since f is continuous. NO? = Ax, and hence the theorem 
on decreasing sequences gives 


lim (Of) = w(Ag). 
Fix n so large that 


(ii) de (k +1)lM(OR) ~ w(Ae)] <1. 


[RIS N 


Since the Ax form a partition of X, the O? form an open cover of X. Let 
yx be a partition of unity subordinate to this cover. Set f, = yx f; then 
f = >° fe and moreover fy < (K+ l)epp, whence I(fr) < (kK + lel (yx). 
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Since 0 < yw, <1 and supp (yx) C Of, we have I(yx) < uw(Of%), whence 


I(f) = Sofie) < Sok + Dew(OP). 


Using (ii), 
LS Sok + l)epw(Ax) + €. 


frw=d fis 


But f(z) > ke if « € Ax, whence 


[ tau > So kep( Ar) 


Furthermore, 


and therefore 
I(f)< [fare (: +E nlae) 
k 
Since Ax is a partition of X, S> u(A,) = u(X). Thus 


I(f) < / fay + e(1+u(X)). 


As € is arbitrarily small, we have proved (i). 
Now, applying (i) to f’ = —f, we obtain the opposite inequality to (i): 
the two inequalities imply equality. 


2.4.6 Proof of the Radon-Riesz theorem (noncompact case) 


Let X be a locally compact space which is countable at infinity. Let {H,,} 
be the exhaustion sequence constructed in 1.0.3 and let u,, be the function 
associated by Urysohn’s lemma with the pair (H»~1, (4,)). 


(i) Lemma. Let C(H,,,) denote the functions defined and continuous on 
H,,. For f € C(Hm), define tm.f by 


(umf )(x) = f(x)Um(2) if ce Ay; 
(Um-f)(x) =0 if x ¢ Hn. 


Then um.f is a continuous function on X. 


Proor. Only the behavior at the boundary of H,,, must be checked. Let 
xo be a point in the boundary of H,,; then w,»(xvo) = 0 and there exists a 
neighborhood U of zg such that |um(x)| < € if « € U. Hence 


(Um-f)(#)| < €max |f|.0 
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(ii) Corollary. Let I be a positive linear functional on Cx(X). Set 
Im(f) = I(um-f), f € C(Hm). 


Then Im, is a positive linear functional on Hy. 


(iii) By the compact case of Riesz’s theorem, proved in 2.4.5, there exists 
a measure j,, defined on the Borel algebra By,, of Hm such that 


Oe i fdtim, Vf €C(Hm). 


(iv) Let f € Cx(X); then there exists p such that 


supp (f) C Hy. 


Hence u,,.f = f ifm > pand I(um.f) =I(f), and thus 


/ fdjim =I(f) if m>p. 


(v) Let O be an open subset of X such that O is compact; then there exists 


p such that O C Hy. 
Hence, letting 


T(O) ={f € Ck(X) : supp (f) C O}, 


we have 
Um(O) =supIm(f), where f €T(O). 
By (iv), 
Im(f) =I(f) if m>p, 
whence 


Um(O) =supI(f) = Um (O) if mand m’ > p. 
(vi) The measures fm and Lm coincide on the Borel algebra By, if m, 
m’ > p. 


PROOF. Let 
Z = {A€ Bu, = bm(A) = bm (A)}- 


Let Og be a decreasing sequence of open sets such that 
NO,=Hp and O,C Hoss. 
Then pim(Og) = Um’ (Oq) by (v). Hence 


Um(Hp) = Um! (Hp). 
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Let B € Z; then, since B® = H, — B, 
Hm (BS) = bm(Hp) ~ bm(B) = bm! (Hp) — bm (B) = Um (B*). 


Hence B € Z implies B° € Z. 
ie} 
Let G be an open subset of H,. Then there exists G’ C H,41 such that G’ is 
open in X and 
G’ OH, =G, 
whence 


G =limG'N Oy. 


By (vy), 
Pan(G’ Og) = tm (GM Og). 


Hence Z contains the open subsets of H,. Taking complements shows that Z 
contains the closed subsets. 

We now use 2.4.4(vii) (Lusin’s criterion) and 2.4.4(ix). Given a Borel set A 
and an « > 0, there exist a closed set K and an open set O such that K C ACO 
and [im(O) < pim(K) +. 

Since fim(O) = fm/(O) and fim (K) = Um (K), it follows that 


Lm’ (BK) < bm! (A) < Lm? (O) = Lm(O) < bm (K) ai €, 
Hm(K) < Um(A) < Um(O) < Um (K) + €. 


Hence 
[Lm (A) am Hm’ (A) <e. 
Since ¢ is arbitrarily small, fim(A) = Um’ (A). 


(vii) Definition of Borel measure. 
Let {H,,,} be the exhaustion sequence defined in 1.0.3. For A € Bx, set 


(A) = lim tm42(AN Am). 
By (vi), 
Hem+2(A ‘a Hy»-1) ce Um+1(A al Hm-1); 


whence the inclusion AN H,»-; C AM Hm implies that the sequence 
{Um+2(AM Hy,)} is increasing. Hence its limit exists and is finite or equal 
to +00. 

We first prove finite additivity. Let A;, Ag € By, Ay M Ag = 0. Then, 
setting A = A; U Ao, 


Lm42(A a Hy») = bem+2(A4 a H,,) ae bm+2(Aa N A). 
Hence, passing to the limit, 


u(A) = (Ar) + p(A2). 


To prove o-additivity, it suffices to show that 4 is continuous on increas- 
ing sequences. 
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Let B; C BoC... C By, ..., where By € Bx, and set Ba = UBy. 
Suppose first that (Boo) = +oo. Let M be a positive real number; then there 
exists m such that 
(Boo Hin) > M. 


By (vi), 
U(Boo 1 Am) = Um+2(Boo N Hm). 


Since fim+2 is continuous on increasing sequences, there exists q such that 
}m+2(Bq nN Hm) > M, 
whence 
u( Bag) > M. 


As this is true for all M, lim (Bz) = +00. 
We now consider the case u(B..) = a < +00. Let € > 0 be given. There exists 
m such that 
a—€<p(BoMHm) <a. 


By (vi), u( Bo Hm) = Um+2(BooN Hm). Since fim+2 is continuous on increasing 
sequences, we have 


lim p2m+2(Bg MN Hm) = Um+2(Boo M Hm). 
q 


Hence there exists r such that 


bm+2(BrO Hm) > Um+2(BooN Hm) — €, 


and thus u(B,) > u( By Hm) implies that u(B-) > u( Bo) — 2e. 


(viii) Representation formula. 
Let f € Cx(X); then there exists m such that supp (f) C He By (iv), 


I(f) = / fees 


Am 


But dum+2 is equal to du on H,»,, whence 


1H) = f Fay. 


(ix) Definition of the associated Radon measure. 
Completing the measure space (X, Bx, ) yields a measure 7, called the 
Radon measure associated with the linear functional J. 


3 Regularity of Borel Measures 
and Lusin’s Theorem 


3.0.1 Hypothesis. We assume that the space X is locally compact, metriz- 
able, and countable at infinity. 
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3.0.2 Definition. A measure y defined on a o-algebra B containing the 
Borel algebra Bx of X is called regular if for every A € B and for every 
€ > 0 there exist an open set O and a closed set F such that F C ACO 
and p(O— F) <e. 


3.1 Proposition. Let X satisfy 3.0.1 and let p be a locally finite Borel 
measure on Bx. Then there exists a Radon measure v such that p(A) = 
v(A) for every A € Bx. 


Proor. Let f € Cx(X). Since the indicator function of any compact 
set is integrable, the inequality |f| < M1lx, where K = supp(f) and 
M =max|f|, implies that f is integrable (see I-7.4.3). 

Hence a positive linear functional can be defined on Cx (X) by setting 


rt) = f fap. 


By the uniqueness theorem (2.3.4), the linear functional I determines 
the measure; that is, if v denotes the Radon measure associated with the 
form I by Riesz’s theorem, then 


p(A)=v(A) forany Ac By.O 


3.2 Theorem. Let X be a locally compact space satisfying the hypothesis 
of 3.0.1. Then every Radon measure js on X is regular. 


Proor. If X is compact, regularity follows from Lusin’s criterion, 2.4.4(vii). 

If X is only locally compact, let A be a measurable subset of X and let H,, 
be the exhaustion of the space constructed in 1.0.3. Set A, = G,NA, where 
Gn = (Hn — Hn-1). Using Lusin’s criterion on the compact set Hy4+1, fix 
a closed set F;, and an open set O, of X such that 


Fy C Anyi, FnrCAnCOn, and p(O, - F,) < €27”. 


Note that F,, is compact. Set O = UO,; then O is open and O D A. 
Similarly, set F = UF,,. By 1.3.2, this union is locally finite (that is, any 
compact set meets only a finite number of F;,); hence F is closed. Clearly 
FcAand p(O-F)<e.0 


3.3 Theorem. Let X satisfy the hypothesis of 3.0.1. Then any locally finite 
Borel measure p on X is regular. 


PROOF. By 3.1, p is the restriction to the Borel algebra of a Radon measure 
y. Since v is regular by 3.2, a fortiori so is p. 0 


3.4 The classes Gs(X) and F,(X) 


3.4.0 Definition. The class of subsets of X which can be written as a 
countable intersection of open sets is called Gs(X). 
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A countable intersection of elements of Gs(X) is in Gs(X). 


The class of subsets of X which can be written as a countable union of 
closed sets is called F(X). 


A countable union of elements of F,(X) is in F(X). 
Clearly Gs(X) and F(X) are subclasses of the Borel algebra. 


3.4.1 Proposition. Let be a regular measure defined on the o-algebra B 
of the locally compact space X. Then for every A € B there ezist 


TEGs(X) and ®EF,(X) 


such that 
®CACT and p(T—) =0. 


PRooF. By 3.0.2, we can find a sequence {F’ } of closed sets and a sequence 
{O/,} of open sets such that 


FiCACO! and p(OL-Fi)<nl. 


Set 


On =Ng<nO7 


and Fy, = UgenF). 
Then F,, Cc A Cc O, and {O,, — F,,} is a decreasing sequence. Furthermore, 
On — Fr C Of, — Fi), whence 0 = lim (Opn — Fr) = lim u(| (On — Fy)). Set 
[T=lim | O, and ® = lim fT Fy. 

Then [ — ® = lim | (O, — F,), whence u(T — &) = 0. Finally, 


lim | On €Gs(X) and lim 7 F, € F,(X).0 


3.4.2 Corollary. Let 4 be a regular measure defined on a a-algebra B on 
X, let ys’ be the restriction of 4 to the Borel algebra Bx, and let fi’ denote 
the measure defined by extending 1’ to the completion Bx. Then Bx D B 
and 1 equals the restriction of fi’ to B. 


REMARK. Cf. I-4.2.2. 


3.4.3 Lusin’s theorem. Suppose that X is a locally compact space, v is a 
regular measure defined on the o-algebra B > Bx, and f is a B-measurable 
function. Then for every compact set H and every € > 0 there exists a 
compact set K such that K C H, v(H — K) <e«, and the restriction of f 
to K is continuous. 


PROOF. Set G, = {x : |f(x)| > n}NH. Then {G,,} is a decreasing sequence 
and v(G,) < +oo, whence 


lim (Gn) = v(NG») = 0. 


Hence we can find m such that v(Gn) < 271e. 
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Considering f’ = flg,, reduces the proof to the case of bounded f. In 
this case, f is the uniform limit of a sequence {g,,} of simple functions (cf. 
1-6.4.1, of which we follow the notation). Let mo be such that |f(x)| < mo 
for all x. Setting 

Jin = {0 © H: f(x) € [hn7}, (+ In}, 


we may take 


In = She Lage where —nmo <k < nmpo. 
k 


Using the regularity of v, we can find a compact set Ay. such that 


Kren C Jin and Sou dion es Kren) <S Qorrte, 
k 


Let V,, = UpKe.n, where |k| < nmo. 

Then V,, is a finite union of compact sets and hence compact. Further- 
more, V(H NV,°) < 2-"~1e, Let W = UnVN A. 

The convexity inequality (1-3.3) gives 


(2) vW)<e and W°NH =D?V,. 


Set Vs, = NV,. Then V,, is compact, whence 


(ii) Kg = Vg Kyo is compact: 
Moreover, 
(tit) Kypn is openin Vy, 


since Kg, VVn = Uj¢rKj.ns | < nmo. 
By (iii), there exists an open subset 2 of X such that QNV, = Ky,, and 
hence Kj, ,, = 2M Voo; it follows that 


(iv) Ky, is openin Vo. 
It follows from (ii) and (iv) that the indicator function of Kj, ,, is contin- 
uous on V,,. This, with the fact that Jpn A Vso = Ken 1 Vo, gives 


(uv) The restriction of gn to Vo. is continuous. Since g,, converges uniformly 
on V,, to f, the restriction of f to Vx is continuous. 
Furthermore, (i) shows that v(H — Vx.) <¢. 0 


3.5 Density theorem. Let X be a locally compact space satisfying the 
hypothesis 3.0.1 and let v be a Radon measure on X. Then for every p, 
1<p< _+oo0, Cx(X) is dense in L?(X,v). 
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PROOF. Let {H,} be the exhaustion of X defined in 1.0.3 and let T, be the 
truncation operator, defined in I-6.7, associated with this exhaustion. Let f € L? 
be given. Then 
(Tn)(f)(z) — f(x) for every rE xX 

and 

Taf — fi? < flP. 

By the dominated convergence theorem (I-7.6), 

ITnf — fllze > 0. 
Let m be such that 

|Tmf — f\lne <e. 
Set fm = Imf; then fm is bounded by m and its support is contained in the 
compact set Hm. Set 7 = (m71e)”. 

Let K be a compact set, depending on m, such that the restriction gm of fm 
to K is continuous and such that v(Hm — K) < 3. Let O be an open set such 
that OD Hm and v(O — Hm) < #. 

By a theorem of Urysohn,! we can find u € Cx(X) such that supp (u) C O, 

u(z)=Ym(x) if «eK, and u(x) <m_ forall cz. 

On K, fm = Ym = u, whence fm —u= (fm —u)1lxKelo. Since |fm — ul < 2m, 

Ilfm — ulltn < (2m)’X(ON K®) < (2m)?(m~*e)?. 
Hence || fm — ullze < 2e, and finally ||f — ullze < 3e. 0 
3.6 REMARK. The regularity of Radon measures allows us to approximate 
L? functions by continuous functions, and measurable sets by open or closed 
sets. 


4 'The Lebesgue Integral on R and on R” 


4.1 Definition of the Lebesgue integral on R 


We first consider Cx(R), the continuous functions on R with compact 
support. The Riemann integral (see the Prologue) defines a positive linear 
functional on Cx(R) by 


rf) = f oat 
Hence there exists by II-2 a Radon measure yz such that 
1(t)= f F)dutt) 


This yu is called the Lebesgue measure on R, and functions measurable 
in this sense are called Lebesgue measurable. 


‘See, for example, N. Bourbaki, General Topology (New York: Springer-Verlag, 
1989), IX.4.2. 
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4.2 Properties of the Lebesgue integral 


We include here only properties specific to the Lebesgue integral. Its most 
important properties are common to all Radon measures, and were estab- 
lished in Sections 2 and 3 of this chapter. 


4.2.1 Proposition. Let a, b€ R, a < b. Then 
u([a, b]) =, (a, b)) =b—a. 
PROOF. p((a,b)) = supI(f) where 0 < f < 1 and supp (f) C (a, 0). Setting 


f=1 on [a+ 2e,b— 2¢], 
f=0 if t<a+t+e or t>b-e, 


and f linear on [a+ €,a + 2e] and [b — 2€,b — €], we obtain 
u((a,b)) > b-a—3e. 
Hence, since € is arbitrary, 
u((a,b)) 2b-a. 


The opposite inequality follows from the mean value theorem for the Rie- 
mann integral. 0 


4.2.2 Theorem. Let O be an open subset of R. Then O is a countable 
union of disjoint intervals: 


(2) O 
(21) WO) = So (be — a). 


PROOF. Let x € O and set 


Un(ax, bk); and 


a(x) =sup{y:y <a, y ¢ O}, 

B(x) =inf{y:y > x, y ¢ O}. 
Since O% is closed, a(x) € O% if a(x) is finite and A(x) € OF if G(x) 
is finite. It follows that (a(a), 6(a)) C O and that there exists no open 


interval which strictly contains (a(x), 6(x#)) and is itself contained in O. 
Moreover, x € (a(x), 3(x)), whence 


O= lala), a(@)). 


LEO 


Define an equivalence relation on O by 


eve’ if (a(x), 8(x)) = (a(2’), A(2')). 
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Since every open interval in R contains at least one rational number, the 
set of equivalence classes is countable and (i) follows. We obtain (ii) by 
using the o-additivity of uw and 4.2.1. 0 


4.2.3 Corollary. Every open set has strictly positive Lebesgue measure. 


4.2.4 Theorem (Characterization of negligible sets). A subset E of 
R is negligible with respect to Lebesgue measure if and only if, for every 
€ > 0, there exists a sequence of intervals (cy, dx) such that 


(ck, de) DE and So (de = Cr) <e. 
k 


PROOF. The sufficiency of the condition follows from 4.2.1 and the con- 
vexity inequality (1.3.3). Its necessity follows from the regularity of Radon 
measures (3.2) and from 4.2.2. 0 


4.2.5 Corollary. Let x € R and let A = {x}. Then p(A) = 0. 
PRooF. O, = (z — 4,2 + +) satisfies u(On) < 2n7!. 0 


REMARK. We can summarize 4.2.5 by saying that a “point” of R has 
Lebesgue measure zero. 


4.2.6 Translation invariance 


For fixed a € R, translation by the vector a is the mapping 7, of R into R 
defined by 

Tai: LO U+a. 
Proposition. Let B be a Lebesgue-measurable subset of R. Then T,(B) is 
Lebesgue measurable and p(T,(B)) = u(B). 


PrRooF. It follows from the definition of the integral J in 4.1 that I(ta(f)) = 
I(f), where (ta f)(x) = f(x — a). The uniqueness of the Radon measure 
associated with a positive linear functional implies the result. O 


4.2.7 Notation 


By abuse of language, we write 


[ stoae for [ sau. 


We thus use the same notation for the Riemann integral and the Lebesgue 
integral that extends it. Translation invariance is written 


(i) I f(t —a)dt = I, f(t)dt. 


The vector space of Lebesgue-integrable functions defined on R will be 
denoted by L1(R). The next statement follows from the translation invari- 
ance of Lebesgue measure. 
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(ii) If f € L?(R), then taf € L?(R) and 


IItofller = llfllan, 1S p< +00. 


4.3 Lebesgue measure on R” 
4.3.1 Definitions and notation 


To simplify notation, we begin by constructing Lebesgue measure on R?. 
We denote by (R.A, 4) the real numbers equipped with Lebesgue mea- 
sure y and the o-algebra of Lebesgue-measurable subsets. Let (R, A, 111) 
and (R, A, 2) be two copies of the measure space (R, A, i). 
Let R? = R x R and let B denote the tensor product o-algebra : 


B= A, ® Ap. 


Then B contains the Borel algebra of R? (I-2.4.2). Let ju; @ jug be the 
product measure defined on B by I-8.4.1. 

Lebesgue measure on R? is the measure v obtained by completing [41 ® p12 
(cf. [-4.2.3). The completion of B is the o-algebra of Lebesgue-measurable 
subsets of R*. We denote by L!(R”) the space of Lebesgue-integrable func- 
tions on R?. 

If f €¢ L'(R*), we write 


is sav= ff f(ti.te)dty dtp. 


Then, by Fubini’s theorem (I-8.5), 


i f (ti, tz) dtidt2 = I dtz if F(t ta) | ; 


Lebesgue measure on R” is constructed recursively, by writing R” = R x 
R"~'. For f € L'(R”), the integral thus obtained is written as 


[Flt tees taldtdta dln 
R” 


and Fubini’s theorem reduces the calculation of this integral to the calcu- 
lation of n successive integrals on R. 


4.3.2 Lebesgue measure on R” and the Radon-Riesz theorem 


To simplify notation, we restrict to the case where n = 2. Let a positive 
linear functional be defined on Cx, (R”) by 


(i) I(y) = ie | Atta) , geCx(R’). 
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By the Radon-Riesz theorem, there exists a Radon measure p such that 


I(y) = i gdp. 
By the uniqueness part of the Riesz representation theorem, 
p(A) =v(A) — for every Borel set A. 
Furthermore, since 
v([—R, +R] x [-R,+R]) = 4R?, 


Lebesgue measure is locally finite and hence regular by 3.2. 
The measures p and v are complete regular measures defined on the Borel 
algebra. 


Lebesgue measure on R? may be regarded as the Radon measure associated 
with (i). 
4.3.3 Translation invariance 
This is proved as in 4.2.6, by using 4.3.2. 


4.3.4 Proposition. Every open set in R” has strictly positive Lebesgue 
measure. 


PROOF. We restrict to the case where n = 2. Let O be a nonempty open 
set and let (t1,t2) € O. Then there exists € > 0 such that 


Q = (ti —6,ti + ©) x (tg -e,t2 +6) CO. 


The product measure of the square @ is the product of the measures of its 
components (I-8.1(i)), whence 


v(Q) =4e2 >0 and v(O) >v(Q).0 


4.4. Change of variables in the Lebesgue integral on R” 


4.4.0 Some facts from differential calculus 


Let O be an open set in R". A mapping f = (f!,...,f”) is said to be a 
diffeomorphism if 


(i) f(O) is an open subset O’ of R” and f is a homeomorphism of 
O onto O’ (i.e. a bicontinuous bijection); and 
(ii) f and g have continuous first partial derivatives, where g de- 
notes the inverse homeomorphism. The Jacobian matriz of f is 
the matrix 
of? 


= Ox,’ 


Jy l<i<n,1<k<n. 
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We then have the following composition law: 

If f and h are diffeomorphisms for which the composition ho f = q is 
defined, then q is a diffeomorphism and the Jacobian matrix of q is the 
product of the Jacobian matrices, 


(iii) Iq = InJp- 


In particular, Jy = J; . 

Thus the Jacobian matrix of a diffeomorphism is invertible: det(J;(x)) 
is a continuous function that is nowhere zero, and hence has constant sign 
on a connected component of O. 


4.4.1 Change-of-variables theorem 


Theorem. Let O and O' be open subsets of R" and let f be a diffeomor- 
phism from O onto O'. 

Let Cx (O') denote the continuous functions which have compact support 
contained in O'. Then 


(i) : p(f(x))|det Jp()|dx = / yla!)dn’ if peCk(O’). 


Oo’ 


REMARKS. 


(ii) Since f is a homeomorphism, y € Cx(O’) implies (yo f) € 
C(O). Since det(Jy(x)) is a continuous function, the inte- 
grands on both sides of (i) are continuous functions with com- 
pact support and therefore integrable. 

(iii) Using a partition of unity on O’, we can write y = >> ys, where 
the y, are supported in arbitrarily small open sets. It thus suf- 
fices to prove the theorem for each y,. This means that we may 
assume throughout that y has sufficiently small support. 

(iv) Functoriality. Suppose that f = g oh, where g and h are dif- 
feomorphisms. If the change-of-variables formula is proved for 
the diffeomorphisms g and h, then the result will hold for f in 
view of the identity 


|det Jy| = |det J,| |det J;,|. 


4.4.2 Lemma. The change-of-variables formula holds when n = 1. 


PROOF. In this case, the formula becomes 


[eveonirela =f oleae’ 


Using (iii), we can reduce the proof to the case where the support of y is 
small enough that f’(x) has constant sign. By the mapping xv — —2z, this 
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can be further reduced to the case f’(x) > 0. Then the formula is 


[ere@s'woar = [ oteae’ 


Set 


F(t) i vl F(a)) f"(e)ae, 


f(t) 
/ y(2' da’. 
f(0) 


Then, differentiating the integrals, we obtain 
G'(t) = e(fO)F'(), 
F'(t) = p(f®)F'(O- 


Hence F(t) — G(t) is a constant. 
Setting ¢ = 0 shows that this constant is zero. 0 


G(t) 


4.4.3 Proof of the change-of-variables theorem 


We proceed by induction on n. Assume that the result holds for m < n. 
Writing z € R” in the form x = (£€,y), where ER, ye R”1, set 


h(x) = (€',y'), where €’= f'(£,y), y'=y, and 
g(x!) = (€,0(2")), where 6 = (f(E,y'), --., #"(E,¥’))- 
The notation PE ,y) means that € has been replaced in this expression by 
é’, by inverting the relation ¢’ = f'(€,y). 


By the implicit function theorem, this inversion is possible in a neigh- 
borhood of 2x9 if 


(i) J (ao) £0. 


But the fact that det Jy # 0 implies that the column vector (35) a 
l<k<n 
is nonzero, and we can renumber the coordinates so that (ii) holds. Thus 


g(x’) can be defined, and it follows from (i) that 
f=goh. 


Using 4.4.1(iv), it suffices to prove the theorem for g and for h. Next, we 
calculate 


[PE Oe det J) dea 
RxR”-! 


By Fubini’s theorem, this equals 


(iti) [ee [eeu lace Jy)av 
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Note that the Jacobian matrix J, has some row for which all entries are 
zero except the diagonal entry, which equals 1. 

Thus det J, = det Jo,,, where Og: y’ — O(E',y’). 

By the induction hypothesis, 


ee lf, y')det(Jo,, dy! = We (ey )dy”, 


and substituting this into (iii) proves the theorem for the change of variables 
defined by g. 
It remains to prove the theorem for h. Note that, by Fubini, 


Gin) fe evrwder(dnddcdy =f dy f o(s'é.ueuddet Inds 


But det Jj, = 9£ and, by 4.4.2, 


1 Of t.. 
fv (Gv. Fede = [ olnadan. 


The result follows by substitution into (iv). 0 


REMARK. This proof can be given the following geometric interpretation. 
Let p: R" — R, where p is differentiable and Vp #4 0 everywhere. Then 
the volume element dug» can be written locally as the volume element 
on the hypersurface p = constant, “multiplied” by TI The induction 
hypothesis allows us to treat the change of variable on the hypersurface; 
the other change of variable occurs in one dimension. 


5 Linear Functionals on Cx (X) 
and Signed Radon Measures 


In Section 2 we studied positive linear functionals on Cx (X). We now drop 
the hypothesis of positivity and substitute the more general hypothesis of 
continuity. 


5.1 Continuous linear functionals on C(X) (X compact) 


Throughout this section, X is a compact space. Then Cx (X) is the space 
C(X) of all continuous functions. A norm is defined on C(X) by setting 


Ifllc = max |f(x)]. 
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Convergence in this norm corresponds to uniform convergence. C*(X) 
denotes the Banach space of continuous linear functionals 1! on C(X); that 
is, those for which there exists a constant a such that 


(Ff) < allfile. 


Setting |[I[lc- = sup_|l(f)| yields I(f)| < [Id 
Wfllo<t 


5.1.1 Proposition. If 1 is positive, then | is continuous. 
PROOF. Indeed, 


file. 


C* 


—Ilflle < f(x) < Ilflle 


implies 

—||FilclAx) SIF) < Wf licl(x), 
whence 
(1) \lc- =11x).0 


5.2 Decomposition theorem 
Theorem. Let X be a compact space and let | € C*(X). Then there exist 
positive linear functionals I+ and I~ such that 

5.2.1 l=I*—-I- and 
5.2.2 WEllo~ = ll lox + II | 


ce 
and such a decomposition is unique. 
PROOF. For each nonnegative f in C(X), let 

H(f) = {we C(X):0<usf} 
and let 
(i) I*(f) =supl(u), where u€ H(f). 


Let fi, fo > 0. Since wu. € H(fi) and ug € A(fo), wi + ue € A(fi + fo); 
hence H( fi) + H( fz) C H(fi + fa). 

We now prove the opposite inclusion. Let u € H(f; + fo) be given. Set 
v = min{u, fi} = $(u+ fi — |u— fil); then v € C(X), v € H(f;,), and 
w=u-veé H(fo). 

Thus u=v+w with v € H(fi), w © H(f2), and we have shown that 


(22) A(fi + fo) = A(fi) + H(fe). 
This implies 
(222) I*(fi + fo) =U (fi) +14 (fo), fis fo 2 0. 


Any g € C(X) can be written as 
(iv) g9=7- 92 with g,g2>0. 


(For example, we can take g; = max(g,0).) 
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Define 
(v) I*(g) = I¥ (gi) — 17 (92). 


We will justify this definition by showing that the right-hand side is 
independent of the choice of the decomposition (iv). Let 


9=93—-94, 93,94 2 0. 

Then 91 — g2 = 93 — ga, or 91 + 94 = 93 + go. Using (iii), 

I* (gi) +17 (ga) = 7 (gs) + (ga), 
or 

I*(g1) — (ga) = I (gs) - I* (ga), 
which justifies definition (v). 

It therefore follows from (iii) and (v) that 
(g+g') = (9) +0°('). 
Similarly, it follows from (i) that 
It(af)=alt(f) if a>0, f>0. 


Since 0 € H(f), we have /*+(f) > 0 if f > 0, whence 
(vi) I+ is a positive linear functional on C(X). 


Setting /~ = 1+ — 1, we have I~ € C*(X). Furthermore, let f € C(X), 
f > 0. Then 


I (f) = (supl(u)) —U(f) = sup((u— f)), where ue H(f). 


For f > 0, set G(f) = {v € C(X) : —f < v < 0}. Then the mapping 
ut>u— f defines a bijection of H(f) onto G(f); hence [~(f) = supl(v), 
where uv € G(f). 

Since 0 € G(f), 17 (f) > 0 and we have thus obtained the decomposition 
5.2.1. 

Let 1 denote the indicator function of the full set X; then, by 5.1(i), 


Ie | 


on(x) =F (1) and | |lo«(x) = 17 (1). 
There exist u, € H(1) and v, € G(1) such that 

[*(1) =liml(u,) and I7(1) = liml(v,). 
It is straightforward to show that 


I los + Ir lle+ = lim (un) + Uen))- 
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We have 0 < un <1, -1 < vp, < 0, and -1 < up(z) + vp(x) < 1, or 
||Un + vnlle <1. 
Hence 
[U(un + Un)| < llllle-, 


and we have shown that 
IF los + Wr Iles < Illes. 


Since the opposite inequality follows from the triangle inequality, this 
proves 5.2.2. 0 


5.2.3 Uniqueness of the decomposition 


Let 
(i) l= yp — w where ¢, w are positive linear functionals. 


Then 
I*(f) =sup{y(u) — 4(u)} with ue H(f). 
But 
plu) — p(u) < vlu) 
since u > 0, and thus 


sup{y(u) — o(u)} < sup y(u) = ¢(f). 
That is, 
I“(f) < 9(f) for every f > 0. 
Set y —1* = 6; then @ is a positive linear functional, and it follows from (i) 
that 
(it) gp=lt+0 and w=l +8. 


Then |[pllow = (1) = I*(1) + (1) = [ll |]ce* + [!8llo*; similarly ||b||c+ = 
Il low + [16lles. 
Suppose that the decomposition y — w satisfies 5.2.2; then 


Illex = Ileller + Illes = IF lox + |e + 2l6lle~. 


Furthermore, by 5.2.2, ||l|lc* = ||I7 |lo= +||U" ||c+. Subtracting these two equations 
shows that 2||6@||c+ = 0; thus 9 =0. 0 


5.2.4 Corollary. Given 1 © C*(X), there are two Borel measures 11 and 
2 uniquely determined by 


Uf) = f fu. — f dys and 
Wl]o+ = i(X) + p2(X). 


PROOF. By the decomposition theorem (5.2) and the Radon-Riesz theorem. 
Oo 
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5.8 Signed Borel measures 


In this section, we establish the equivalent of Theorem 5.2 for Borel mea- 
sures. 


5.3.1 Definition. A signed Borel measure on the compact space X is a 
mapping 
vy:By ~R 


that can be written in the form 


(7) V(A) = pi(A) — pa(A), 


where /1, p2 are finite Borel measures. The decomposition (i) is clearly not 
unique; adding the same Borel measure 6 to p; and p2 will not change the 
mapping v. 


5.3.2 Mutually singular measures 


Two Borel measures 1, and 12 are said to be mutually singular if there 
exists a Borel set A € By such that 


(2) (A) =™4(X) and v2(A) =0. 


The relation is symmetric, for A‘ carries all the mass of vg and has 14- 
measure Zero. 


5.3.3 Theorem. If X is a compact space, there exists a bijection between the 
continuous linear functionals on C(X) and the signed Borel measures. The de- 
composition of a linear functional given in Theorem 5.2 corresponds to the de- 
composition of the signed Borel measure as a difference of two mutually singular 
Borel measures. 


PROOF. We use 5.2.4. The only statement still needing proof is the equivalence 
of the following two properties: 


(2) Ilo = pi(X) + p2(X). 
(ii) pi and p2 are mutually singular. 


We first show that (ii) => (i). If p1 and p2 are mutually singular, let A be an 
element of Bx such that 


pi(A) = pi(X) and p2(A) = 0. 
Set 
Q= la ae Lac. 
Then 
[ea =pi(X) and [vv = —p2(X), 
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whence 
[eta — dpz) = pi(X) + p2(X). 
By Theorem 3.5, we can find f € C(X) such that 
lf —¢llz: <¢, where L’ = L'(X,Bx,p1 + p2). 


Set 


f(x) = f(z) if |f(x)| <4; 
f(z) =signum f(z) if |f(x)| > 1. 
Then lf - yllp1 < 2e and 


/ (dp. —dpx) > pil X) + 2X) —2e, where fe C(X), Ilflle <1. 


Conversely, we show that (i) => (ii). There exists a sequence {in} in the closed 
unit ball of C(X) such that U(yn) — |[l|c*. Set yn = Yn — Y; then 


(en) = | | etan + f onan - | f exam + f etan| 3 


Since ys; <1 and vy, < 1, the first term in brackets is at most equal to pi(X) + 
p2(X) = |\llc« by (i). Hence the convergence of I((pn) to ||!||c~ implies that 


[ etn — pi(X) and [even >, 


Since 
[1 - gbllainy = [A-etdder, 


we conclude that 


Il1 — ga llzt¢1) 7 0 and ||Pn llz1(p1) — 9: 


Passing to a subsequence {Yn,}, we may replace the convergence in L* (pi) of 
{1—t} by convergence pi-a.e. Passing to a new subsequence {n, } reduces the 
proof to the case where 2), = yi, satisfies 


Ws converges tol pi-a.e.; 
ws converges to0 p2-a.e. 


Let 
A= {x: limp? (x) = 1}. 
Then 
1 —Valler(oy) =O and |[Lallz1(p2) = 9; 
or 
pi(A) = pi(X) and pe2(A) = 0.0 
5.3.4 Proposition. Let v be a signed Borel measure. Then there exists a 
decomposition 
v= pi — p> 
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such that p} and p§ are mutually singular. Such a decomposition is unique. 
We set 
Iv] = pt + p3 


and call |v| the absolute value of v. 


PROOF. Let a continuous linear functional on C(X) be defined by setting 


Ip) = fed = f odo - | edps. 


Then, by 5.3.3, the decomposition of v as a difference of mutually singular 
Borel measures corresponds to the decomposition of | given by 5.2.4. This 
decomposition exists and is unique by 5.2.4. 0 


5.3.5 Signed Radon measures 


Given a signed Borel measure v on the compact space X, let p? — p§$ be its 
canonical decomposition. Let B be the completion of the Borel algebra Bx 
with respect to |v|. We define a signed measure on B by setting 


u(B) = p(B) — p(B), VB eB. 


jt is called the signed Radon measure associated with the signed Borel 
measure Vv. 

If X is a locally compact space, a signed Radon measure v on X is given 
by two mutually singular Borel measures 1; and v2. We set |v| = v1 +v2 and 
define the o-algebra 6, by completing the Borel algebra Bx with respect 
to |v|. Then, if A € B, and |v|(A) < +00, we define v(A) = (A) — 2(A). 


5.3.6 Important remark on terminology 


Let X be a locally compact space. We denote by M(X) the vector space of 
signed Radon measures and by M+(X) the Radon measures on X; that is, 
the measures associated with positive linear forms. In the usual terminol- 
ogy, M(X) is called the space of Radon measures and M*(X) the space 
of positive Radon measures. From the point of view of grammatical accu- 
racy, this terminology is better than ours; a noun modified by an adjective 
should describe a narrower class of objects than the noun alone. Our use 
throughout Chapter I of the word “measure” to mean a positive measure 
may justify our ignoring this rule now. 


5.3.7 Complex measures 


We denote by C(X; C) the space of continuous complez-valued functions on 
the compact space X. Separating real and imaginary parts, we can write 


C(X;C) = C(X) @ C(X). 
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A C-linear functional | on C(X;C) is determined by restricting Re(l) to 
each summand of the direct sum. Since X is assumed compact, specifying 
lis equivalent to specifying two signed Radon measures 4; and jig. Setting 
Well = eal + [val] and 1 = pr + tpy2, we have 


fein) = f far — f hips +i f fag + hdy: 


[41 + tpg is called the complex measure associated with this form. 


5.4. Dirac measures and discrete measures 
5.4.1 Dirac measures 


Let X be a locally compact space X and let x» € X. The Dirac measure 
at Zo is the linear functional 


leo(f) = f(to), Vf € Cx(Xo). 


This positive linear functional is represented by a Borel measure 6,, whose 
completion is defined on the o-algebra P(X) consisting of all the subsets 


of X. We have 
6z,(A)=1 if rEA 
6¢,(A)=0 if ap € A. 


5.4.2 Discrete measures 


Now let 71,...,2;,... € X and a; € R. Suppose that, for every compact 
set K, 


(2) > la;|<+0o, where Sx ={j: 2; € K}. 
JESK 


A locally finite signed Borel measure v is defined by setting, for B € Bx, 
V(B)= > ay, where j € Sp = {j: 2; € Bh. 


This series is absolutely convergent by (i). Let vt = ast ajéz, and let. 
v~ = Yia,<o ~2j42;- Then v* and v~ are locally finite Borel measures. 


Completing the Borel algebra with respect to |vy| = vt + v~, we recover 
the o-algebra of subsets P(X); hence 


|v\(C) <+00 is defined VC € P(X). 


In contrast, v(C) is defined only for those C € P(X) which also satisfy 
|v|\(C) < +00. 

We denote by Ma(X) the discrete measures on A and by M}(X) the 
finite discrete measures: Mj(X) = M1(X)N Ma(X). 
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5.5 Support of a signed Radon measure 


5.5.1 Definition. Let u € M(X). The support of , written supp (jz), is 
the smallest closed set F’ such that |j|(F'°) = 0. Let us show that this set 
exists. Taking complements, finding Fis equivalent to finding the largest 
open set Hf such that |u|(H) = 0. 

The hypothesis 3.0.1 implies that X satisfies the second separability ax- 
iom of I-2.4.1. Therefore we can find a countable family of open sets O, 
which forms a basis for the open sets. Set 


S={n:|ul(O,) =O} and AH =UnesOn. 


Then H, as a countable union of sets of measure zero, has measure zero: 
\n|(H) =. 

Let O’ be an open set such that |y{(O’) = 0; then O' = UnerO,, (since 
{O,,} is a basis for the open sets). The hypothesis |,:|(O’) = 0 implies that 
[]((On) = 0, whence TC S and O' C H. 


5.5.2 Proposition. Suppose that X is a locally compact space, f € Cx(X), 
and 4 € M(X). Then 


| tae=o if supp (f) supp (1) = 0. 


PROOF. Let p= fy — fg with |u| = py + fe, and let H = (supp (j))°. 
Then f = 0 |pl-a.e., whence f = 0 ae. fj, 7 = 1,2, which implies that 
f fap: = 0, i= Led. 


6 Measures and Duality with Respect 
to Spaces of Continuous Functions 
on a Locally Compact Space 


6.1 Definitions 


We consider the following three vector spaces of continuous functions on 
Xx: 


Cx(X). the continuous functions with compact support; 
Co(X), the continuous functions which vanish at infinity; and 
C,(X), the bounded continuous functions. 


(i) Recall that a function f is said to vanish at infinity if, for every € > 0, 
there exists a compact set K such that |f(x)| < « for « € K. We have the 
following inclusions: 


(77) CK(X) C Co(X) C Or(X). 
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If X is compact, these three spaces coincide; if X is not compact, each 
of the inclusions is strict. A norm is defined on C,(X) by setting, for f € 
Cr(X), 


(ii2) IIfllc, = sup |f(2)|, ve x. 


This norm defines, by restriction, norms on Co(X) and CxK(X). The re- 
striction of || ||c, to Co(X) will sometimes be denoted by || |[c,. We then 
have 


(iv) |h\lcg = max |A(x)|, 2 © X. 


The difference between (iii) and (iv) is that, although the supremum may 
not be attained in (iii), it is attained in (iv) and gives a maximum. 


6.2 Proposition. The space C,(X) equipped with the norm (iii) is com- 
plete. The space Cy(X) is a closed subspace of Cy(X) and is therefore com- 
plete. The space Cx(X) is a dense subspace of Co(X). 


PRoor. Only the third (and hardest) assertion will be proved here.” 
Let {H,} be the exhaustion sequence of compact sets constructed in 


1.0.3. Recall that Hy, C oe For each n, let Yn, Wn be a partition of 


io) 
unity subordinate to the open cover consisting of the two sets H,+1 and 
H<. Then, since yn + Yn = 1 on X, 


Yn =1 on Ay. 


Given h € Co(X), set hn = hyn. Then hy € Cx (X) and |[h — halleg = 
\|hvnllc, ~ 0 as n + oo, since supp (Yn) C Hf and h(x) — 0 as x tends 
to infinity. O 


6.3 The Alexandroff compactification 


Given a locally compact space X, we can associate with it a compact space 
Y and a homeomorphism w of X onto Y with one point removed. Y is 
called the Alezandroff compactification of X. The construction consists of 
adjoining a point at infinity to X by setting Y = X U {oo}, where oo is 
a new element. The complements of compact subsets of X are taken as a 
system of open neighborhoods of oo. 


Having thus defined Y from the set-theoretic point of view, we now construct 


a topology on Y in a more precise way by specifying its closed subsets. 


?For the first two, see for example E. Hewitt and K. Stromberg, Real and 
Abstract Analysis, 3rd ed. (New York: Springer-Verlag, 1975). 
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A subset F’ of Y is closed if and only if it satisfies the following conditions: 


(i) FMX is closed; and 

(ii) if FO X is not compact, then oo € F. 

Let p be the injection of X into Y; then, by (i), p~'(F) is closed in X if F is 
closed in Y. Let H be closed in X; then, by (ii), H or H U {oo} is closed in Y. 
Intersecting with {o0}" shows that H is a relatively closed set in {oo}°. Thus p 
is a homeomorphism of X onto {oo}. 

The open neighborhoods of oo are the complements of closed sets that do not 
contain oo; that is, the complements of compact subsets of X. It follows easily 
that the topology of Y is Hausdorff. 

We now show that Y is compact. Let O, be an open cover of Y. There exists 
Yo such that oo € O,,; hence there exists a compact set K such that Oy = K°. 
The sets O..9 A form an open cover of K. Let O,,  K,..., Oy, AK be a finite 
subcover. Then O,,, ..., O,,, form a finite subcover of Y. 


6.4 Proposition. Let X be a locally compact space and let Y be its Aleran- 
droff compactification. Set 


V={f eC(Y) : f(c) =O}. 
For every function f € V, let f denote its restriction to X. Then 
| 
is a linear mapping which is an isometry of V onto Co(X). 


Proor. Let f € V:; then the restriction f of f to X defines an element 
ee C,(X). Furthermore, since f is continuous at oo, for every € > 0 there 
exists a compact set A’ such that |f(z) — f(oo)| < ¢ if « ¢ K. Hence 
f € Co(X). 

Conversely, let h € Co(X). Then h can be extended to Y by setting 
hi(co) = 0 and setting hi(x) = A(x) if x € X. Since h € Cy(X), hy is 
continuous at the point oo and hence continuous everywhere. 0 


6.5 The space M'(X) 


(i) We denote by '(X) the set of signed Radon measures v on X such 
that |v| is finite, and define a norm on M1(X) by setting 


lan = ' dlv| = |vI(X). 


Moreover, for every Borel set A of X, v(A) = 1(A) — 2(A) is well defined. 
(See 5.3.5.) 


(ii) Proposition. Let Y be the compactification of X and let 


W = {v © M(Y): v({co}) = O}. 
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Let a mappingv > Vv 
Ww M'(X) 


be defined by setting 
v(A)=v(A) VAE Bx C By. 
This mapping is an isometric bijection of W onto M'(X). 


PROOF. It suffices to note that every B € By can be written either as 
B = AU {oo} or as B = A, for some A € Bx. In the first case, the 
additivity of v gives v(B) = v(A) + v({oo}) = (A) since v({oo}) = 0. 0 


6.6 Theorem. M1(X) is the Banach space dual of Co(X). 


PROOF. With the notation of 6.4, Co(X) ~ V c C(Y). Let 1 € C*(Y); 
then its restriction to V defines a continuous linear form on V. By the 
Hahn-Banach theorem, every linear functional on V can be written in this 
way. Thus 

(V)" ~ O'(Y)/H, 


where H is the space of linear functionals which vanish identically on 
V. Since V has codimension 1, H has dimension 1 and is therefore the 
vector subspace generated by 6.., the Dirac measure at infinity. But, in 
the notation of 6.5(ii), W ~ M(Y)/H, whence (Co(X))* ~ M(Y)/H = 
W ~ M!(X). All these identifications are isometric. In particular, for every 
we MX), 


sup f faye = [lallan0 
lflleg <1 
6.7 Defining convergence by duality 


The following three spaces of continuous functions are associated with a 
locally compact space X: 


Cx(X) C Co(X) C Co(X). 


Convergence in M(X). A sequence {tin}, Un € M(X), is said to converge 
vaguely to fo € M(X) if 


(2) / fdjin > / fduo, Vf € CK(X). 


Convergence in M1(X). Given a sequence {vp}, Un € M1!(X), we have 
two new concepts of convergence. 
Un is said to converge weakly to vo if 


(it) [navn at [ rave, Vh € Co(X). 
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Vy, is said to converge narrowly to Vm if 
(tit) [keen => [ kv Vk € Cy(X). 


Since M'(X) C M(X), vague convergence can be defined on M1(X) as 
well. Thus M1!(X) is provided with four notions of convergence, which 
imply each other according to the following diagram: 


(convergence in norm) = (narrow convergence) = (weak convergence) => 
(vague convergence). 


6.8 Theorem. Let pi, € M1(X). Consider the following statements: 
(1) {fn} converges weakly. 


(it) {Un} converges narrowly. 
(iit) There exist a constant c and a dense set DC Co(X) such that 


linllags <¢ and [sen converges for every g € D. 


(iv) For every € > 0, there exists a compact set K such that 
len |(°) <e for all sufficiently large n. 


(v) Each 1, is positive, {,} converges weakly to pu, and f[ din > 


fdp < +00. 
Then 
(iti) © (i), 
(i) and (iv) = (ii), 
(v) => (ii). 


REMARK. To simplify the exposition, we prove only the direct implications, 
which are the easiest; these are practically the only ones used in what 
follows. 


PROOF THAT (iii) = (i). The family of linear functionals on Co(X) 


nf = f Fun 


satisfies 

n(f — FI < ellf — filleo: 
It is thus an equicontinuous family. Since it converges on a dense subset 
D, by Ascoli’s theorem® it converges on all of Co. Let lo(f) = limln(f). 
Using 6.6, we find that J, is defined by a Radon measure wx. € M1(X) 
and that {,,} converges weakly to uw... 0 


3See Bourbaki, General Topology, X.2.5. 
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PROOF THAT (i) AND (iv) = (ii). Let f € C,(X). Let K be the compact 
set determined by (iv), and let y denote a function with compact support 
such that y(z) = 1 if € K. Then f = fy+tu, where supp (uv) c K°; 


hence 
i fille i fiodtin + a tid fin: 


By (i), the first integral converges to f yfdjiso, where dja. is the weak 
limit of {du}. Moreover, 


| 1 udfin 


PROOF THAT (v) => (ii). It will suffice to prove that (v) implies (iv). Given 
€ > 0, let K be a compact subset of X such that p((K°)) < «. Let f bea 
function with support contained in K such that 0 < f <1 and 


S Jlulle, lun |(K°) < ell fl]-0 


jf fae> Wu 


Let no be such that, ifn > no, 


[ain [ay] <e and [ff aun — [4 dul <e 


pin(CRD) < Noll — fF din whence pin ((R)) < Be ifn > no 


Then 


O 

6.9 Theorem. Let X be a locally compact space and let Mj ,(X) denote 
the finite linear combinations of the Dirac measure on X. Then, for any 
uw € M'(X), there exists a sequence {pn}, tn € Mj ,(X), such that {tn} 
converges narrowly to .. 


PROOF. Let {yn} be an increasing sequence of functions with compact support 
such that 0 < yn < 1 and limgyn = 1. Then |lynpu — pl|yg1 — O by Lebesgue’s 
dominated convergence theorem. Hence it suffices to prove the theorem when yu 
has compact support K’. Let {On,; : 7 € [1,$n]} be a finite cover of K by balls of 
radius +. Let Ana = Ona, Anz = Ona O83, and set Ang = Ang MK. Then 
each A,,q has diameter < + and the A,,q form a partition of K. Restricting to 
An.q #9, choose n,q € Ang: 
Ln is constructed by setting 


Un = ye U(Anq) beng 
q 


Let f € Cy(X); then 


pred f du. 


An,q 
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Since f is uniformly continuous on the compact set K, there exists a sequence 
{tm} which tends to zero as n — oo and satisfies 


|f(x) — f(e’\|< mm if d(z,2!) <>. 


Hence 


i ane yee ene ‘| icra teal aw, 


where |@,| < 1. Summing over gq gives 


[ram fr dun 


S m|HI(K). 
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Fourier Analysis 


Introduction 


Fourier analysis can be illustrated by analogies from optics. Given a light 
beam, the goal of spectral analysis is to determine the monochromatic 
beams it contains; that is, the beams of the form exp( 2). Once a spec- 
tral analysis has been carried out, one can ask whether the analysis is 
exhaustive: is all the energy of the beam really concentrated in the band of 
frequencies where the spectral analysis was done? One can also ask whether 
the beam can be reconstructed from its monochromatic components: can 
spectral synthesis be performed? 

It is well known that quantum mechanics determines the possible energy 
levels of a system as the eigenvalues of a hermitian operator defined on 
a Hilbert space H. More generally, given a system of pairwise-commuting 
hermitian operators, the eigenvalues of the system are the possible values 
of the associated “observables” . 

In the general setting of spectral theory, the problems of spectral analy- 
sis, conservation of energy, and spectral synthesis remain completely mean- 
ingful. Taking the space L?(R”) as a Hilbert space H and the hermitian 
operators generated by the translations as a family of operators, one nat- 
urally recovers Fourier analysis as a special case; what is more surprising 
is that. general spectral theory can be obtained as a classical theorem of 
Fourier analysis, Bochner’s theorem. This will be done in Appendix I. 

Since differentiation operators on L?(R”) appear as limits of translation 
operators, Fourier analysis realizes their spectral decomposition as well. 
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Thus partial differential equations with constant coefficients are subject to 
the methods of real Fourier analysis (or complex Fourier analysis, but we 
will not pursue this point). 

Studying the domains of definition of the Laplace operator and its iter- 
ates in L?(R") leads to the construction of Sobolev scales, a theory that 
is stable under local diffeomorphisms and thus well suited to the local the- 
ory of partial differential equations with variable coefficients. In dealing 
with the theory of distributions, we use the approaches of Sobolev and 
Schwartz simultaneously. The chapter ends with the local inversion of el- 
liptic operators with variable coefficients, by means of Calderon’s theory of 
pseudo-differential operators. 


1 Convolutions and Spectral Analysis 
on Locally Compact Abelian Groups 


1.1 NoraTion. Let G be an abelian (commutative) group . The group 
operation will usually be written additively: 
(91-92) > 91 + 92. 


With this notation, the identity element will be denoted by 1 and the 
inverse of g by —g. 

A locally compact abelian group is an abelian group which is given the 
structure of a locally compact topological space compatible with the group 
operation. That is, the mapping from G x G to G' defined by 
(2) (91.92) > 91 — 92 
is continuous. It can be shown that a metrizable group G has a translation- 


invariant metric d; that is, d satisfies 


(ii) d(go +. 9.90 + 9') = d(g.g'). 


1.2 Examples 
1.2.0 The integers Z form a group under addition. Given the distance de- 
fined by d(n,m) = |n — m|, they form a locally compact group. 


1.2.1 R”, with vector addition, is a locally compact group. 


1.2.2 The one-dimensional torus 


Let 
T={zeC:|z|/=1}. 
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T is the set of complex numbers of modulus 1. (From the set-theoretic 
point of view, T is a circle.) T is given the group operation defined by 
the multiplication of complex numbers. If 21, z2 € T, then 2;z2 € T and 
2, 1 — 3, € T. Thus T is an abelian group; when endowed with the topology 
induced by C, it is compact. 

1.2.3 The n-dimensional torus 

We denote by T” the product of n copies of T, endowed with the product 
topology and the product group operation. 

1.2.4 A homomorphism from R onto T 


With 0 € R, we associate the element 
u(0) =e eT. 


Then u(O + 6’) = u(6)u(@’), ie. u is a homomorphism of R onto T. The 
kernel of wu is 
u'(1) = {6:e = 1} = 2nZ, 


where Z is the subgroup of R consisting of the integers. Let C(T) denote the 
functions defined and continuous on T and let C,(R) denote the bounded 
continuous functions on R. Let u* be the map from C(T) into C,(R) 
defined by 

(u" f)(@) = f(u(@)), VOeR. 


Then the image of u* consists of those functions h € C,(R) that are periodic 
with period 27; that is, functions satisfying 


h(6 + 27) = h(6). 
1.2.5 A homomorphism from R” onto T” 
With x = (21,...,2n) we associate 
eS ewe), 
The kernel of v is 27Z”. The operation 
fr fov=v'f 


maps C'(T”) onto the n-fold periodic functions on R”; that is, functions h 
satisfying 
h(a+y)=A(x), Vy € (20Z)”. 


1.3 The group algebra 


M'(G) denotes the Banach space of signed Radon measures on G which 
have finite total mass. 
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1.3.1 Discrete measures 


Let 6, denote the Dirac measure at the point g and let 
+90 
M}(G) = \! EM'(G): = 5° Brbdg,. where S~ || < +0} 
k=1 


1.3.2 Convolution in Mj (G) 


The convolution of two Dirac measures 6,, and 6,, is defined by 
bg, * Ogo = Og +g2- 


That is, the convolution product is the Dirac measure at the point g; + go. 
This definition is extended to Mj(G) by bilinearity. Given w = Y> Bxdg, 
and pi! = Y7 31.59 in Mj, we set 


pel = S> BrBsOq, 495° 
ks 


Note that the convolution product is commutative, associative, and bi- 
linear: 
ew = pl * py, 
(up!) «wl = wx (ul sp"), 
(utvjep =prp tos, 


Moreover, 
Ie H'laecay < D> Be Bil = (= i! (= i = ell le’ 
ks k s 


(Strict inequality can occur only if gx +g) = ge +94, with (k,s) 4 (k’, s’).) 
We would like to extend the convolution operator from Mj}(G) to all of 
M? by an explicit formula realizing this extension. Let Co(G) denote the 
continuous functions on G which vanish at infinity. 


1.3.3 Fundamental lemma. Let pu, pu’ € Mj(G) and let p = u* p’. Then 


[sede =f f tet wane. vf € Col). 
G GJG 
PrRoorF. The right-hand side, which we denote by //, can be written as 


IT=S° fon + 94.) BnB- 


kk’ 
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Grouping together all terms such that 9% + 9, = gy, we obtain 
=the) Sl hes / ANd): 
Ie+9,0 =I 


1.3.4 Definition of the convolution product on M1(G). Given v, v’ € 
M}(G), we define a linear functional on Co(G) by setting 


lf) = a d fle + y)dv(x)dv'(y). 


This integral converges, since 


NAL < i. [ fle +y)I ldv(2)| |dv(u)I < MNFlles [ [ dd@ dN, 


where A = |y|, ’ = |v’|. By Fubini, 


[ [ dd @ dX’ = (G)A'(G) = [lvl] [lela 
GJG 
and hence 


(i) WA) S WF llcoceyllvllae: lela 


Thus / is a linear functional on Co(G) which is continuous in the norm 
topology. By II-6.6, there exists a measure 0 € M1(G) such that f fdo = 
l(f). We set 0 =v *v’, and call o the convolution product of v and v’. 


1.3.5 Theorem (Properties of the convolution product). Let G be a 
locally compact group and let M1(G) be the Banach space of finite Radon 
measures on G. The convolution product is defined on M1(G) by the for- 
mula 


(i) i f(2)aX(2) = i i fle + y)d(a)de'(y), Vf € Co(G), 


where v, v’ € M'(G) andX\ =v «v’. 


It has the following properties. 


(it) Iv *"|| < [lol te" 

(itt) vev'=v' xv (commutativity) 

(iv) (v«v')*xv" =vx(v'«v") (associativity) 
(v) (V+n')av" =ve+vx«v" (linearity) 
Furthermore, 


(vi) if {un} and {v/,} converge narrowly to vo and 44, then vp xv}, converges 
narrowly to Vo *%. 
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PROOF. Formula (ii) follows from 1.3.4(i). In order to prove (vi), note that 
the narrow convergence of v, and vj, and Fubini’s theorem imply that 
V, © vi, converges narrowly to vp @ vj. Let f € Cy(G) and set u(x, y) = 
f(a+y). Then u € C,(G x G), and 


lim 7 u div, @dvj, = / u dy ® dr% 
Gx@ GxG 


can be written as 


tim ff Fe + v)avale) @ dui (y =f [fe + wan xr) @ dy), 
Vf €C(G). 
Thus (vi) is proved. 
The algebraic properties (iii), (iv), and (v) can be proved by passing to 
the limit and using (vi), since these properties hold on Mj(G) by 1.3.2. By 


11-6.9, M}(G) is dense in the topology of narrow convergence on M!(G). 
(Or this could easily be proved directly.) 


1.3.6 Support of the convolution product 


If F, and Fy» are subsets of G, we set 


Fit+ Fo={g:g=91+ 92 with g; € Fj}. 


Proposition. Let v;, v2 € M'(G). Then supp (vy. *v2) C supp(™) + supp (V2). 


PROOF. [oe + y)dv;(x)dve(y) = 0 if y is zero on supp (1) + supp (v2). 
i) 


Equality holds if both measures are positive. 


1.4 The dual group. The Fourier transform on M! 
1.4.1 Characters 


Let G be a locally compact abelian group and let T be the multiplicative 
group of complex numbers of modulus 1 considered in 1.2.2. A character 
on G is a mapping 

x:GoT 


such that 


(i) xy is continuous, and 
(ii) x is a homomorphism: y(g + 9’) = x(g)x(g’). 
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1.4.2 The dual group 


The set of characters of G is denoted by G, and is given a group structure 
by defining the product v3 of two characters x; and x2 as follows: 


x3(9) = x1(g)x2(9), V9 EG. 
The inverse x4 of x1 is defined by the formula 


a Y 
x4(9) = x1(9) = alan 


Thus G is an abelian group. The identity element is the trivial character 
xo defined by 


xo(g)=1, Vg EG. 
1.4.3 The Fourier transform on M1+(G) 


Given » € M1(G), we assign to it a function defined on G by 


i= a x(g)du(g). 


i is called the Fourier transform of pu. 


1.4.4 Fundamental theorem (Trivialization of the convolution 
product). Let u,v € M'(G). Then 


[L*xV = pid; 


that ts, the Fourier transform maps the convolution product of measures to 
the usual product of functions. 


PrRooF. Let p = w*v. Then 
iE x(2)dp(2) = [ p x(a + y)dp(e)dv(y) 
/ | x(z)x(y)du(x)dv(y) 
GJG 


= ( ii x(@)du(e)) ( I x(v)av(u) 


B(x)P(x)-0 


The first equality follows from 1.3.5, the second from the identity x(r+y) = 
x(x)x(y), and the third from Fubini’s theorem. 


REMARK. Let 69 denote the Dirac measure concentrated at 0. Then 


do(x) =x(0)=1, VWrEG. 


I 


Moreover, 
box =n, Vue M1(G); 
that is, 69 is the identity element of the algebra M1(G). 
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1.5 Invariant measures. The space L' 
1.5.1 Translation-invariant measures 


A measure p € M1(G) is said to be translation invariant if 


(i) i f(@-+g0)du(a) = j fla)du(g), Yoo € G. 


1.5.2 Proposition. Suppose that yu satisfies (1) and that G is compact. 
Then 


(ti) E(x) =0_ for every nontrivial character. 


PROOF. Let x be a nontrivial character. Then there exists gg € G such 
that x(go) # 1. Condition (i) can be written in the form 


(itt) bg * = pl. 


(iv) Since G is compact, u(G) < oo and thus p € M'!(G). Under these 
conditions, 1.4.4 can be applied: 


(69) * (0) = b96 00200 = x(go) A(X), 


whence 
x(90)#(x) — A(x) = 0 => f(x) = 0.0 


1.5.3 Corollary. Suppose that G is a compact group, pu is a translation- 
invariant Radon measure on G, and L?(G; 1) is the associated Hilbert space. 
Then any two distinct characters of G are orthogonal. If the measure yu is 
also normalized by the condition 


pant. 


then the characters of G form an orthonormal system. 


PROOF. Given x1, x2 € GC we evaluate 


(x11Xx2) 22 =| x1(g)x2(g)du(g). 
ene G 
x1(9)x2(9) = x1(9)(x2(9)) 7! = x3(9), 


where x3(g) € G. By 1.5.2, the integral { xs(g)du(g) is zero if x3 is not 
identically equal to 1, that is if x; 4 x2. Finally, if is normalized, 


ballZ2 = [ aloxaWiaeta) = f auto) =10 
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1.5.4 Haar’s theorem. Let G be a locally compact abelian group. Then 
there exists a translation-invariant positive Radon measure [19 on G, and 
this measure is unique up to a multiplicative constant. 


REMARK. If yg is an invariant measure and c is a positive constant, it is 
clear that cup is an invariant measure. 

We assume without proof this general theorem of Haar, and restrict 
ourselves to constructing invariant measures in the special cases of the 
groups R, T, and Z. 


1.5.5 Examples of Haar measure 
(i) Counting measure on Z 


Let Z be the set of integers. Consider the measure zp such that 
o({n}) =1 forevery ne Z. 


Then po is translation invariant. 
(ii) Lebesgue measure on R 


Let R. be the additive group of real numbers. The Lebesgue measure po 
is translation invariant (II-4.2.6) and hence a Haar measure. 


(iii) Haar measure on T 


Let » : R — T be defined by setting 
y(0) =e”. 
Let a mapping ao : T > R be defined by 
o(¢)=arg¢, where arg ¢ € (0,27). 


Then o(¢) is a Borel mapping from T into R. Set 


vo(A) = 5 po(0(A)): 


then Vv is a Borel measure on T. Moreover, 


Qn : dé 
i, fd = fe). “anid [em =1. 
T 0 20 


Lemma. The measure VY is translation invariant. 


PRooF. Let 69 € [0, 27) and set 


Qn O694+27 Qr 2r+0 
F dé ° 2, ar : 
Igo, = i(8+00) \ * = | iA) 2A —} i | f 
ra } ‘ (° ) 27 Jo, er 2x Jo 7 Qn 


0 
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Setting A — 27 = u in the last integral yields 


Oo +270 a 
2 .\, dA o du 
irX = iu 
[se ge [ reng. 


| f dvo = Ip,.0 
T 


Uniqueness of the Haar measure in (i) is clear. For case (iii), it will be 
proved in 2.2.8. 


whence 


(iv) The product structure 


The measures on Z", R"”, and T” are the products of the Haar measures 
on each factor. 


1.5.6 Notation 


The Haar measure of the group G will be denoted by dg. If G is locally 
compact, this measure is defined up to a normalizing factor. If G is compact, 
the factor is chosen so that G has measure 1. 


1.6 The space L'(G) 
1.6.1 Identification of L'(G) with a vector subspace of M1(G) 


We denote by L1(G) the space of functions integrable with respect to Haar 
measure on G, and define an injection 


j: L(G) > M\(G) 
by associating with the function f € L'(G) the Radon measure 
(7) up = F(g)dg. 
1.6.2. The convolution product on L1(G) 


Proposition. Let f, h € L'(G) and let ys and pup, be the Radon measures 
associated with them by 1.6.1(i). Then there exists k € L1(G) such that 


(i) Mp * in =e (L'(G) is a subalgebra of M'(G)). 

k is defined by 

(i) Kao) = f F(o0 - a)h(adg = [hao ~ a) fla). 

where the two integrals converge almost everywhere in go with respect to 


Haar measure. We write 


k=fx*h. 
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(iii) REMARK. Since the convolution product on L'(G) is the restriction of 
the product on M1(G), it satisfies the identities 1.3.5(ii) to (v). 


PRooF. Let y € Co(G); then 


(2, Mf * Uh) = a i: (gi + 92) f (g1)h(g2)dgidge. 


Using Fubini’s theorem yields 


(0, My * Hn) = [Monde iff plgi + on) )das| 


Set g1 = 93—g2 inside the brackets. Since dg is invariant under translation, 
dg, = dg3 for fixed g2, whence 


(p, 65 * tn) = . h(ga) don | i: olgs)F(9s — onda 
Using Fubini again, we obtain 
(Q, My * Ma) = [ elasdees i h(g2)f (93 — on)doa| : 


Fubini’s theorem implies that the integral in brackets converges for almost 
every g3 and is an integrable function k € L1(G). We have thus shown that 


(p, Hi * M2) = | elo Kaa). 


1.6.3 The Fourier transform on L! 


The Fourier transform on L! is obtained by restriction from the Fourier 
transform on M? and thus is written 


(i) fo = [ fla)x(g)dg, Wx € @. 


Theorem 1.4.4, on the trivialization of the convolution product, gives by 
restriction 


(iz) FRO) = FODKOD). 


1.6.4 Bessel’s inequality. Let G be a compact abelian group and let 
f € L°(G). Then f € L'(G) and 


Ifl2ae = >> FOOP. 


xeEG 
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PROOF. Since p(G) < +00, 1-9.6 implies that L1(G) > L?(G). Moreover, 


~ 


fd) = FX) z2- 


Let S be a finite subset of GC, let Vs denote the vector subspace generated 
in L*(G) by {vy : x € S}, and let fs denote the orthogonal projection of f 
onto Vg. Then f = fs + f — fs, where f — fs is orthogonal to fs. Hence 


lIfllZ2 = lfsllZ2 + If - fsllz2. 
and therefore 
IIfllZ2 = lfsllze- 
But it follows from 1.5.3 that 


fs =o x(fbo = 35 foc')x and 


xES xES 


lfslz2 = So FOC) P.O 


xES 


1.7 The translation operator 
1.7.1 The translation operator on L?(G) 


Given a function f defined on G and a fixed go € G, we denote by 7,, f the 
function defined by 


(7) (Tg. f)(9) = f(g — go): 


By the translation invariance of dg, f € L?(G) implies (7, f) € L?(G), and 
moreover 


(ii) Ito f lle = Wf lle. 
Furthermore, 
(tit) Tg, O° Tg2 = Tg1 492° 


We summarize the last identity by saying that g > 7, is a representation of 
G in L?(G); that is, the mapping is a homomorphism of G into the group 
of linear automorphisms of L?(G). We define the translate of a set A by an 
element go of G to be 7,,(A) = A+ go. 

If u4 is the indicator function of the set A (ua(z) = 1 if  € A and 
ua(z) = Oifx ¢ A), then 7,,(ua) = Ur, (4): 


1.7.2 Fundamental theorem (Trivialization of the translation op- 
erator on L'(G) under the Fourier transform). Let f € L'(G). Then 


Tol (x) = x(go) f(x) Yr eG. 
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PROOF. 


Tal (x) = [ f(g - 90)x(9)dg. 


The change of variables g ++ g— go = g' leaves the Haar measure invariant: 
dg = dg’. Making this change of variables gives 


[to ~ go)x(g)dg = J f0o)x6a! + 90)d = x(a0) f #(o')x(g)o'.0 


1.7.3 Continuity of the translation operator 


Let Cx(G) denote the compactly supported continuous functions on G, 
with the norm 
WF licx = max |f(g)|, ge G. 


Continuity theorem. (i) Let f € Cx(G). Then the mapping from G to 
Ck (G) defined by g++ Tgf is uniformly continuous. 

(it) Similarly, let u€ L?(G), where 1 < p < +00. Then the mapping from 
G to L?(G) defined by g +> Tgu is uniformly continuous. 


PRooF.(i) Since f is continuous and compactly supported, f is uniformly 
continuous. Given € > 0, there exists 7 such that 


\f(g1) — flg2)l<e€ if d(gi,92) <n. 


Hence 


IT9.(F)(9) -T9 (F)(9)| = |F(9-90)-flg—go)l<€ if d(g—go,g-9) <0. 
But it follows from the invariance of the distance under translation (cf. 
1.1(ii)) that d(g — 90,9 — 96) = 4(go, 95), whence 
IIta0(F) — Te (Alo, <€ if d(go. 90) < 7-0 
(ii) We now consider the case where u € L?. Since p < +00, by II-3.5 there 
exists f € Cx(G) such that || f — ullz» < 4. Let us write 
Tgt —Tgtt = Tf —Ty f +79 (f —u) — T9(f — u). 
Using 1.7.1(ii), 
€ 
Irolf - w)llur = IF - ular < §, 
whence 9 
\|T9u — Tgtl|Le < 3° + |ltof — To fll ze. 
Let A = supp(f). Then 
supp (Tf — Tf) C Tg(A) U Ty’ (A), 
meas (supp (Tf — Tq f)) < 2 meas (A), 
IItof — To fllze < \Itaf — T'fllc,(2 meas (A))1/?. 


The right-hand side of the last inequality tends to zero as d(g, g’) — 0 by 
the first part of the theorem. O 
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1.8 Extensions of the convolution product 


In this section, we give other cases where formula 1.6.2 converges. 


1.8.1 The convolution product and the dual pairing 
Let f denote the function defined by 


Formula 1.6.2(ii) can be written formally as 


(i) k(go) = (Ta f+) = (f.Toah)- 
Lemma. Let f € L?(G) and h € L4(G), where 1 < p < +00 and p and q 
are conjugate exponents. Then, for every go € G, the integral 


(ii) F) f(ao — 9)h(g)dg 


converges and defines a function k(go) which is uniformly continuous and 
bounded and which satisfies 


(iii) lle, S WFllze llAlice. 


PROOF. By symmetry, we may assume that p < q; then, since p and q are 
conjugate, 1 <p < 2. 
Using (i), we have 


CO) = [ro FAY S ly Fllco Pla = [flloollPllca, 


and moreover 


[k(90) — Ri) = Mao — Ta FP) S Woo F — Tas Fille Alco. 
Since p < +00, it follows from 1.7.3(ii) that the first term tends to zero 
when d(go. 91) — 0. 


1.8.2 Theorem (Action of M'(G) on L?(G) (1 < p < +00)). Let 
w€ M'(G) and let f € L?(G). Then the integral 


(2) h(go) = [foo — g)du(g) 


converges almost everywhere in go with respect to Haar measure and defines 
a function in L?. Furthermore, 


(%i) Pllc» S WF llze (ella. 
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ProorF. Let f’ = |f| and let yu’ = |u|. Let u € L7, u > 0, and consider the 
double integral 


[= [ [te — g)u(go)dgo du'(9). 
Choose a Borel representative of the equivalence class of f in £?(G). For 
this fixed choice, f’(go — g) is a Borel function and hence measurable with 


respect to the product measure dgp © dyi(g). Thus Fubini’s theorem can be 
applied once we have shown the convergence of 


puto if f'(g0 — 9)u(go)dgo| - 


By 1.8.1, the integral in brackets is convergent and bounded above by 
Ilf lz |lullz2, whence 


(iii) IT} < lela ll fllze [lel ce. 


Letting u equal the indicator function of a compact set K, it follows from 
Fubini’s theorem that the integral (i) converges dg-almost everywhere on 
K. Since K is arbitrary, (i) converges dg-a.e. on G. Let h(g) be the function 
thus obtained. By (iii), 


[[ mayuta)da < alae, Ve € 28 
If p > 1, then q < +00 and we define a linear functional on L4(G) by 
K(u) = f h(g)u(g)ds. 


This form is bounded, since |/(u)| < C|lul|z7. By the duality theorem (IV- 
6.3), it follows that h € L?. If p =1, take u(g) = sign(h(g)) if h(g) 4 0 and 
u(g) = 0 otherwise. Then (iii) implies that 


J iPadldg < Wallarslfllzs < +00. 


Thus h € L!. 


1.8.3 The translation operator as a convolution operator 


Note that if yu = 6,,, then 


ac — g)du(g) = f (90 — 91) = (Ta f) (90). 
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In particular, tf = f. Thus the action of M1(G) on L1(G) is a general- 
ization of the translation operator. More generally, if jz € Mj} (cf. 1.3.1), 
then pp = >> Gpdy, and 


[tw — g)du(g) = eS G79. f) (90). 


Notation. Let  € M'(G) and f € L?(G) (1 < p < +00). We make the 
notational convention that 


(rf)(90) = i f(go — g)du(g). 


Then 
tu(Alce < lellae Wf lize. 


1.9 Convergence theorem. Let {jin} be a sequence of measures in M!(G) 
satisfying hypotheses (iii) and (iv) of Theorem II-6.8 and converging nar- 
rowly to v. Then 


lt,f —wWAf\lbe 70, Vf EL, 1<p<-+o. 


If in addition f € Co(G), then ||T, f — tf llcy > 0. 


REMARK. Using the converse of Theorem II-6.8, it would suffice to assume 
that {u,} converges narrowly to v. Because this converse was not proved, 
we prefer to give the rather awkward statement above. 


PROOF. Since p < +00, we can find h € Cx(G) such that 
lf — Allae <e. 
By hypothesis II-6.8(iii), sup || un ||ag1 = ¢ < +00, whence 
IITun (F— Plier S Wenllars lf — Alize < ell f — Allee. 
It thus suffices to show that 
(2) IT, — TA||L» — 0. 


Hypothesis II-6.8(iv) implies that for every « > 0 there exists a compact 
set H such that, for sufficiently large n, 


" d|un|<eé and ( dlv| <e«. 
He He 


Let y be an element of Cx (G) such that supp (y) C K; and y = 1 0n H. 
Set 
Hn = Pin, Y= pv, py, =(1-Y)in, v" =(1—-¢)y. 
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Then |v’"|| < , ||w"|| < €, and the proof is reduced to proving (i) for pi, 
and v’. Furthermore, since ju), converges narrowly to v’, it suffices to show 
that (i) holds when the yz, are supported in a fixed compact set Ky. Let 
K2 be the support of h; then the support of 7,,,h lies in K3 = K, + Ko. 

But K3 is a compact subset of G. Moreover, by the definition of narrow 
convergence, for every fixed g 


/ h(g — g')dpn(g') > i h(g — g’)dv(q’). 


That is, un(g) = (tT, — Th)(g) satisfies un(g) > 0 everywhere. It follows 
from the bound |t,,,h| < cl|h|lc,1K«; that 
[un(g)| S 2cl[Allo, 1x, = C11 Ks. 


Hence, by Lebesgue’s dominated convergence theorem, 


‘; lun(g)Pdg > 0.0 


If f € Co(G), we now determine h € Cx(G) by the condition ||h — fllcy < e. 
As above, we reduce the proof to showing the result when the yn are supported 
in a fixed compact set K3. Setting h(€) = h(—€), we write 


/ KO SuieO= i (rah) (€)dpen(€). 


The mapping ® : g Th from G to Co(G) is continuous. Hence the image under 
® of the compact set K3 is a compact set H C Co(G). By hypothesis II-6.8(iii), 
there exists a constant c such that ||Un||,71 < c. Consider the functions u, defined 
on H by 


eh i Wage e. 


Since ||/4n || ,¢1. < ¢, these functions are equicontinuous. By the definition of narrow 
convergence, 


Gi / y()dv(€), Yy € Co(G). 


Since the functions un are equicontinuous and converge for every y € H , the 
compactness of H implies that they converge uniformly. 0 


1.9.1 Corollary. Let {,} be a sequence of measures which converge nar- 
rowly to 69 and satisfy hypotheses (iii) and (iv) of II-6.8. Then ||tu, f — 
f\lz» > 0. 


1.9.2 Corollary. L?(G) is an M1(G)-module; that is, 


(i) (Tt). ° Tr) (f) = Tyar (f) = (tr 0 Ty)(f). 
In particular, if go € G, 


(it) Tg (Th) = Tul Toof) = T 1x8 9, (f). 
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PROOF. It suffices to verify (i) in the case of discrete measures, where 
everything is obvious; the general case follows from the narrow density of 
Mj(G) in M'(G) combined with Theorem 1.9. 

For (ii), we note that 


(iit) T gq Ut a 8 4 *U, 


where 6,, denotes the Dirac measure at go, and use (i). 0 


2 Spectral Synthesis on T” and R” 


In Section 1 we introduced the Fourier transform, defined on the dual group 
G. We were not concerned with whether the dual group of G contained 
other elements than the trivial character, everywhere equal to 1. If G were 
trivial, Fourier transform theory would have a very limited scope. We now 
exhibit the characters on T” and R” and use them to prove the injectivity 
of the Fourier transform. In certain cases, we will be able to characterize 
its image and give an explicit inversion formula. 


2.1 The character groups of R" and T" 


(i) The characters on R are of the form 
x(x) =e, where tER, t fired. 


Hence R=R. 


PROOF. It is clear that an imaginary exponential satisfies the equation e 
e7e'Y and is a complex number of modulus 1. What must be proved is the 
converse. Let z ++ x(x) be a character of R; then, since y(0) = 1 and a is 
continuous, there exists an interval [—a, a] such that 


it(aty) _ 


Re(x(z)) > 0 if aw € [-a,al. 
Hence we can define a function I(x) without ambiguity by 


log x(x) = il(z), . <I(x) < Z xz € [-a, al. 


Then /(x) is continuous and 
Uat+y)=Uar)+l(y) if xz, y, andxt+y€é [-a,al. 


It follows from this equation that (mx) = ml(x) if m is an integer such that 
|ma| <a, and similarly that 1(4) = Al(y) if |y| < a. Hence I(ra) = ri(a) for 
every rational number r such that |r| < 1. 

By continuity, [(za) = xl(a) if ze € R, |x| < 1. Hence 


tay 


x(y) =e", where a= *1(2) and |y| <a. 
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For any yi € R there exists an integer m such that y: = my with |y| < a; thus 


taym __ Pucral o 


x(y) = (x(y))” =e 


(ii) The characters on R” are of the form 


n 
x(x) = exp (: S- 1) ‘ 
k=1 


where x = (x!,...,2") € R” andt = (ti,...,tn) € R". Hence R" =R". 


PROOF. The imaginary exponentials are obviously characters. It must be shown 
that every character is of this form. Let ex = (0,0,1,0,...) be the kth element 
of the canonical basis of R”. Then \ +> Aex is a homomorphism from R to R” 
and hence \+> x(Aex) is a character on R. By (i), we can write 

ith d 


x(Aex) =e 


Writing c = 5°, x*ex, it follows that 


x(z) = [] x(2*ex) ay I] 2 
k 
(iii) The characters on T” are of the form 


Xm(0) = exp (: S- rt 
k=1 


where m =(mj,...,™Mn) € Z" and (e,..., 8") © T”. Hence T” = 2". 


PROOF. The numbers (6',...,6”) are each defined only up to a multiple of 27; 
this indeterminacy has no effect on the value of xm(0) since m € Z”, and thus 
Xm (0) is indeed a character on T”. 


Conversely, let x be a character on T”. We define (cf. 1.2.5) a homomorphism 
1 


v:R” > T" by setting v(x) = (e* nee er) Then y ov is a character on R” 


and hence, by (ii), is of the form 


x(u(x)) = exp (i by» tux") : 


Suppose that v(x) = 1. Then y(v(x)) = 1; hence 
So tex* = 0 modulo 27. 
k 


Setting x equal successively to 27e1, 27€2,..., 27en shows that ti, t2,...,tn € Z. 
oO 
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2.2 Spectral synthesis on T 
2.2.1 The Poisson kernel 


Given a number r € [0,1), the Poisson kernel on T is the function defined 
by the series 


(i) OLS Peas 


né€Z 


Not only is this series uniformly convergent, but its sum can be calculated: 
+00 +00 
P,(8) = So (re?) + (re?) 
n=0 p=l 


Using the formula for the sum of a geometric series, we obtain 


1 re’? 
EO Tore® * [pe 
Thus 
l-r? 1-r? 
oo P. fa) = = = h = 10 
() (8) 1—2rcos@+r?2 9 (1—r¢€)(1—r€) WHOLE: Se 


2.2.2 Proposition. Let e = 1 denote the identity element of T and let 
dv(¢) denote the Haar measure on T defined in 1.5.4. Then 


P,.(G)dv(6) — 6. narrowly as rl 


and, moreover, satisfies hypotheses (iii) and (iv) of I-6.8. 


PROOF. Let f be a continuous periodic function, with period 27. We must 
show that 


: +7 do 
(i P.(O)fO)s— > f(0) as rad. 
(ii) Note that, by 2.2.1(ii), P.(@) > 0. 
Integrating the uniformly convergent series 2.2.1(i) term by term shows 
+1 de 
that / EO) 5 = 1. Hence, since P,(6) > 0, 
T 


(iii) ||P, llo. = 1. 
(iv) For fixed 7 > 0, maxy<jo)<7 P,(@) = P,(n), which approaches zero as 


r—il. 


Set fi(@) = f(A) — f(0). Since 


+7 fa 
f(0)P(@)5— = f(0), 


at 
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it suffices to show that 


” HOP. Oe — 0. 
— 
Let « > 0 be given. Then there exists 7 such that |f1(@)| < § if |0| < . 
Fixing 7, we split the integral in two: 


+r dé +n do d6 
AOPO =f HORA T+! AOR. 
=a] nN tn\oN|—-7 +7 


The first integral is bounded above by §||P,||z:, which equals § by (iii), 
and the second by ||f lloyT) Pe(n), which approaches zero by (iv). 
Hypothesis (iv) of II-6.8 clearly holds since T is compact, and hypothesis 
(iii) since P,.dv has total mass 1. 0 
(v) Corollary. Let du, denote the measure on M1(T) defined by 
P,(0)d0/2r. 
If f € L?(T) (1 <p < +00), then ||\tu,. f — f\lnz 2 0 as r > 1. 
If f € C(T), then ||tu, f — fllccr) > 0. 
PrRooF. By 2.2.2 and 1.9.1. 
2.2.3 Proposition. Let f € L'(T) and let f(n), n € Z, be its Fourier 
transform. Then 


(f * P,)(0) = s F(m)ri™ e- 


mez 
PROOF. 
P= [Hype 
és [ow bi eeacae de 
=i neZ on 


The uniformly convergent series }7 <7 rile’n(°—¥) can be integrated term 
by term, giving 


(f*P)(0) = So rilein(®) is Oe pe 


new 


So rinlein(® f(—n). 


neZ 


The result follows by setting —n =m. 0 
2.2.4 Spectral synthesis theorem. 
(i) Let f € L?(T), 1 < p< +oo. Set 


— Dy r'"l F(n) eine 


neZ 
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Then ||f — grllze +0 asr — 1. 
(ii) Let f © C(T). Then ||f — grllecr) - 0 as r 1. 


REMARK. Since g, is defined in terms of the Fourier transform of f, the 
theorem shows that f can be reconstructed from its Fourier transform. 


PRooF. By 2.2.2(v) and 2.2.3. 
2.2.5 Theorem on conservation of energy. Let f € L?(T). Then 


(i) liza) = DIF)? and 
neZ 
(iz) | f= ss f(n ei +0 as p>+oo. 
n=—p L2 


Plancherel’s theorem. Let £7(Z) denote the set of sequences such that 
>> lanl? < +00. 


(iii) The mapping f > f defines an isometric isomorphism from L?(T) 
onto ¢7(Z). 


PROOF. Since the characters on T are mutually orthogonal, 


llgrliz2 = Sor FnyP. 
By Bessel’s inequality, 
lifllzz = Do |F@)P 
neZ 


For a proof by contradiction, assume that the inequality is strict. Since 
If — grllz2 > 0 by 2.2.4, |lgrllz2 > Ilfllz2. Hence 


lim ss reli fin)P > >- [f(r 
new 
a contradiction; Bessel’s inequality is in fact an equality and (i) is proved. 
Let V, denote the vector subspace of L? generated by those e’”® for which 
—p <n <p. Then (cf. 1.6.4 and 1.5.3) the orthogonal projection of f onto 
Y, can be written as 
= > finje’. 
In| <p 


By the Pythagorean theorem, 
If — spllz2 + IispliZ2 = Wf lize, 


whence 


If — spll22 = IF ll22 — IispllZ2 = D> Lf)? 


[n|>p 
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where the second equality follows from (i). Since the last expression tends 
to zero, (ii) is proved. _ 

To prove (iii), let U : L?(T) — @(Z) be defined by U(f) = {f(n)}. 
Then U is an isometry by (i). It follows that the image of U is a complete 
subspace of [?(Z) and hence is closed. 

Let W = {{an} € @ : an = 0 except for finitely many n}. The function 
that maps {a,} € W to the trigonometric polynomial $7 a,e7*”® is con- 
tinuous, since the sum is finite. Because the function is continuous, it lies 
in L?; thus U(L?) > W. Since W is dense in ¢? and U(L?) is closed, we 
conclude that U(L?) = @7.0 


2.2.6 The Fourier inversion formula 


If we are given f and want to evaluate the function f at a point, the only 
result at our disposal so far is 2.2.4(ii). The drawback of this formula is 
that it involves a double limit: we must first sum a series, then let r tend 
to 1. 

We would like to obtain results on the convergence of the partial sums 
of the Fourier series of f, that is the sums 


n=+p 


sp(0) = S> Flnje'"?, 


n=—p 


Theorem 2.2.5(ii) is a convergence theorem for the L? norm. 

Lennart Carleson showed in 1965 that the partial sums of the Fourier 
series of a function f in L*(T) converge almost everywhere to f. He thus 
resolved a problem that had remained open for fifty years. The following is 
an elementary result. 


2.2.7 Fourier inversion theorem. Let f € L'(T). Assume 


(i) S-lF(n)| < +00. 
Then 


(it) f(@) = > f(n)e"™® for almost every 0. 


If f is also continuous, equality holds everywhere. 


PROOF. Set 


Gr =f*Pp= d= f(njrinlenin? and (6) = Ss; f(njei”®, 


neZ 


Then y € C(T) since the series converges uniformly. We now show that 


(itt) lle — grllecr) — 0. 
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Clearly 
le — grllecr) ie f(r] - rinly, 


Given e, fix p so that Voinisp |f(n)| < €/2. Then eines lf(n)|( — inl) 
is the sum of 2p + 1 terms, each of which tends to zero. This proves (iii). 

It follows from the inequality ||g-— llr: < |lgr-—¢llecr) that lim,—o ||gr— 
yllz1 = 0. By 2.2.4, 


lf — elle: = 9. 


Thus f and ¢ are equal a.e., and (ii) is proved. 

Suppose that f is continuous; then, since y is continuous, so is f—y = u. 
If u were not identically 0, {u 4 0} would contain an interval, contradicting 
(ii); hence u = 0 everywhere. 0 


(iv) REMARK. As an element of L', f is defined only up to a set of measure 
zero. (ii) means that the equivalence class of f under the relation of equality 
almost everywhere contains a continuous function, namely y. It is reasonable to 
take this continuous function as a representative of the equivalence class of f. 


2.2.8 Density of the trigonometric polynomials 


A finite linear combination of exponentials is called a trigonometric poly- 
nomial. 


Proposition. The trigonometric polynomials are dense in the normed 
spaces L?(T) (1 <p < +00) and C(T). 
PROOF. Since CT) is dense in L? by JJ-3.5, it suffices to prove density in 
C(T), recalling that || |[z» < || llccr- 

Let h € C(T) and let « > 0 be given. Using 2.2.4(ii), fix r such that 
[2h — hrllecr) < §. Decompose h, as 


h, (0) = oe A(n)ritlein? + > h(n)riMe7in?, 


|n|<p |n|>p 
Note that |h(n)| < lAllcacry < llAllecry; this implies 


QrPtl 


l-r’ 


S> h(n)rinlem"?| << |lhllocay 


|n|>p 


Since r is fixed, this expression is less than § for sufficiently large p. Thus 


h- > A(n)r'"y_n|| <€0 


|n|<p 
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Corollary (Injectivity of the Fourier transform on measures). Let 
pu, v € M'(T) satisfy 


p(n) =v(n) if neZ. 
Then p= v. 


Proor. Let Q be a trigonometric polynomial. By linearity, f Qdv = f Qdu. 
Since the trigonometric polynomials are dense in C(T), it follows that 


[tava [ tau Vf €C(T).0 


Corollary (Uniqueness of Haar measure on T). Let p be a Haar 


measure on T. Then there exists a constant c such that p= ci. 


Proor. By 1.5.2, p(n) = 0 if n 4 0. It thus suffices to use the preceding 
corollary. 0 


2.8 Extension of the results to T” 
The Poisson kernel is defined on T” by 


P.(¢) = [] PC"), C= (",-.-.0") © T”. 
k=1 


Since the Haar measure dv(¢) = dv(¢) ®---@dv(¢") is a product measure, 
P,(G)dv(¢) = Pp(¢")dv(¢*) @--- @ P,(¢")dv(C"). 


By 2.2.2 each term converges narrowly to é¢,; hence P,(¢)dv(¢) converges 
narrowly to é¢. 
It can be shown as in 2.2.3 that, for all f € L1(T”), 


2.3.1 (f * P,)(@) = oS Flm)riimll etm 9 
mez” 
where ||| = |. | + |mo| +--+ |mn| and m.6 = R_, mpd". 


The following theorems are proved as in 2.2. 
2.3.2 Spectral synthesis theorem. Let f € L?(T) (1 < p < +00). Set 
Gr (9) = So f(m)riirlle“*™-8, Then 

If — Grllzecrny) — 0. 


If f € C(T), then 
lf — grllecr) — 9. 
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2.3.3 Theorem on conservation of energy. Let f € L?(T"). Then 


(i) Ifllz2 = 3 Lfemp. 


mez” 


Set 8,(9) = dimes, f(m)ew?™, where Sy, = {me Z" : |mz| < p Vk}. 
Then 


(zi) If — Spllzz¢r") 70 as p— +00. 


(iit) (Plancherel) The mapping f > f is a bijection of L?(T") onto £7(Z"). 
2.3.4 Fourier inversion theorem. Let f € L'!(T"). Suppose that 


(i) = |f(m)| < +00. 
mez" 
Then 
(a) f(0) = S f(mjen?”? for almost every 6. 
mez” 


(iit) If f is continuous, equality holds everywhere. 


2.4 Spectral synthesis on R 
2.4.0 Regularity of the Fourier transform on R” 


Let pp € M1(R"). Its Fourier transform is defined by 
Att) = [et dy(e) 


2.4.0.1 Proposition. The Fourier transform ji(t) is a bounded continuous 
function and 


(¢) WHllc,an™) S Wellan- 


PROOF. Set ps = |u| with k € L),. Then 


Attn) = f et e(x)d\l(a). 


If the sequence {t,,} converges to tg, the sequence of functions {fe *k(xr)} 
converges everywhere to e’-*k(x). Since it is bounded in modulus by 


le Liat Lebesgue’s dominated convergence theorem implies that fi(tn) — 


ii(to). Finally, 


ate) sf all = |wiR") = allan. 
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2.4.0.2 Theorem (Lebesgue). /f f € L1(R"), then its Fourier transform 


f(t) = San ef (x)dx is a continuous function that vanishes at infinity, 
and 


(7) filcoarny < If lla. 


PRooF. Since f(x)dz € M'(R”), the only new property to be proved is 
that 2: 
f(t) 0 as |ltl| + +00. 


Let « > 0 be given. Since the translation operator is continuous on 
L'(R"), there exists 7 such that 


(it) Ityf —filn<e if, |lyll <n. 


It follows from the property Ty f(t) = et f(t) that (GF = fee) = 


nw 


(eur = 1 F(t). 
Using (i) and (ii), 


(iit) le DFO <e if llyll <n. 


If t satisfies ||t|| > m~!, we can find y such that y.t = 7 and |ly|| <n. 
Hence, by (iii), 
2\f()|<e if |l¢]) > an7t.o 


2.4.1 Dilations and the Fourier transform 

A dilation on R is multiplication by a positive number A: 
rredAxr Vere R, 2 fixed, A> 0. 

Given a function u defined on R, let 

(2) uy(z) = A7*u(A7!z). 


Take u € L'(R) and set \~!x = y. Then fuy(x)dx = f u(y)dy. In partic- 
ular, 


(12) Iualler = llullze. 

Similarly, again setting \~!x = y, 

(tit) ty(t) = Jawetan = [ueay = u(t). 

2.4.2 Lemma. Let u € L'(R) and assume that fu(x)dx = 1. Then, as 


A— 0, uy(x)dx converges narrowly to the Dirac measure at 0 and satisfies 
hypotheses (iti) and (iv) of Theorem II-6.8. 
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ProorF. Let f € C,(R) and set fi(a) = f(x) — f(0). Then 
[ weoriteiae = (0) fuste)de + f falo)ua(eae, 


Since the first integral on the right-hand side equals 1, it suffices to show 
that the second tends to zero. Setting A~!x = y, we can write this integral 
as f fi(Ay)u(y)dy. Fix A so that Jiyio-a u(y)dy < {fille - Then 


: , ue | 
[a y)ju(y)dy | cs ee 
ee Jalen fimo < 5° and 


les 


Since A is fixed, AA — 0 as A > 0. Since f\(0) = 0 and f; is continuous, 
the last expression will be less than § for \ sufficiently small. 0 


IA 


IA 


maxy}<y4 |fi(t)| llulles- 


2.4.3 Proposition. For every > 0, 


: tu 1 aa ita 
0 ool £8) =e howl $) oe 


Proor. Cf. IV-4.3.2(ii), where this formula is proved for = 1. The general 
case is obtained by applying 2.4.1 (iii). 


2.4.4 Proposition. Set 


1 x 
Gale) = Gaye °F, ): 
Then, as 1 > 0, G,(x)dx satisfies the conclusions of 2.4.2. 


ProoF. It follows from 2.4.3(i), with t = 0 and yw = 1, that 


1 x 


It now suffices to apply 2.4.2. 0 
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2.4.5 Spectral synthesis theorem. Let f € L1(R), let a be its Fourier 
transform, and set 


ae 2 Na 
(i) ala) = f e**Fitvex (-) F. 
If fe Li nL? (1<p< +ov), then 


(it) lf -gullze 0 as pO. 


REMARK. We must assume that f € L1, since otherwise the integral sae 
ing the Fourier transform f does not converge. Moreover, since lf fil LoS 
||fllz2, this assumption implies the convergence of the integral defining g,,. 


PROOF. By 2.4.2 and 1.9, 
(iit) \|\f *Gu — fllze — 0. 
Furthermore, since G, is an even function, 
+6.) = f Gulu ays 
An integral expression for G,,(x) is obtained by interchanging t and zg, writing 


pw! for yw, and multiplying by CHENIES in 2.4.3(i). Substituting this into the 
integral above yields 


(f * Gu)(a) = [tw if exo (—"F) eon di: 


The hypothesis f € L’ implies the convergence of the double integral 


2 
ff eo(-E) tat av ae 
R2 


Hence Fubini’s theorem can be applied; reversing the order of integration gives 


= Pp —itxz ity dt 
+(e) = f exn(—SB) em | f ervey] 


Recognizing the quantity in brackets as f(t), we have shown that 


(iv) (f * G,)(2) =f f(t) exo (-“) evite aE Vf EL. 
R 2 Qn 


Now (iii) and (iv) imply (ii). 0 


2.4.6 Fourier inversion theorem. Let f € L!(R). Suppose that 


(i) feL(R). 
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(ti) f(a) = | ete fys for almost every x. 
R 


(itt) If f is also continuous, equality holds everywhere in (ii). 


PROOF. Let g, be defined as in 2.4.5(i). Then, as w — O, the integrand in 
2.4.5(i) tends everywhere to e~"” f(t). Furthermore, it is dominated by the func- 
tion |f(t)| € Lt. By Lebesgue’s dominated convergence theorem, 


—ita F dt 

; i = : 

ate) — fe“ FOE = ote) 
Next, since || f — Gull,, 9, we can extract a subsequence jx such that 

f(x) =limg,,(x) almost everywhere. 
This implies (ii). 

To prove (iii), note that y(x) is continuous by 2.4.0.2. Thus v(x) — f(x) = u(x) 
is continuous. By the same reasoning as in 2.2.7, u(x) = 0 ae. > u(x) = 0. 


In the next section, we will study the space of those functions f to which the 
Fourier inversion formula applies. 


2.4.7 The Wiener algebra A(R) 


Let 
A(R) = {f € L'(R): f € L'(R)}. 


It follows from 2.4.6(ii) that the equivalence class (for equality almost ev- 
erywhere) of every f in A(R) contains a continuous function. From now 
on, we will take this function as the representative of f. Thus the Wiener 
algebra is contained in the Banach space of continuous functions. 

The Fourier inversion formula can be applied to f if and only if f € A(R). 


We set |Ifll aaa) = lf lla: + Wflle- 
(i) f € A(R) is equivalent to f € A(R). 
PROOF. By the Fourier inversion theorem, 


fea) f ores. 


Set f(—x) = u(x). Then u(x) = f f(t)e"” #4; that is, 


nn 


(iz) u=(f)”. 


Hence As 
(fy elieuveLlefet. 


(wii) If f € A(R), then f € Co(R) and |If\lo,aR) S Ifllaaay-: 
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PROOF. By the inversion formula and 2.4.0.1. 


(iv) If f € A(R), then fe LP Vp l<p<-+oo. 
PROOF. / [fae < Ife Ul Fllee- 


(v) If f, h € A(R), then f xh © A(R). 


n~ 


PROOF. ||f «Allg: < |IfllzilAllza and (f * h)* = f h, whence 


Wf *h) es =F Alles < filo Wales < WfllealiAll ca. 
Thus f xh € A(R). 
(vi) Let f, he A(R). Then (fh). =f xh and fhe A(R). 
PROOF. By (ii), 
(fh)(—2) = (F*h)*(2). 


By (i) and (v), fxhe A(R). The inversion formula can be applied, and 


[umene E = Fine. 


Hence, replacing x by —z, we see that (fh)* = fxhe A(R); by (i), fh € A(R). 
(vii) A(R) is dense in L?, 1 <p < +00. A(R) is dense in Co(R). 
PROOF. Let L%, denote the L” functions which are zero a.e. outside a compact 
set. Then L% is dense in L?. Let h € L¥.. Set hn = h* G,,-1, where G, was 
defined in 2.4.4; then ||hy — Al|ze — 0. 


We now show that h, € A(R). Let K be a compact set such that h(x) = 0 
a.e. on Kk“. By Hélder’s inequality, 


1/q 
Ihllas < i ae lllae, 
K 


where p and q are conjugate exponents. Thus h € L' and hy € L. Moreover, 
~ ~ v? e? 
|Rn(t)| = ew en(-5) S |[hllcs oo(-5) ; 

whence Rn € L' and An € A(R). 


If h € Cx(R), then hn =h*G,-1 € Co(R), hn € A(R), and ||h —Rnllcy = 0. 
oO 


132 III. Fourier Analysis 


2.4.8 Theorem on conservation of energy. Let f € L'N L?. Then 
IIfllz2 = (20) Wf lizz- 
PROOF. Let f € L'NL?. Set fn = f *G,-1. Then 
A t? \ x 
(i) fn€L' and fn(t) =exp (-5) f(t). 
The Fourier inversion theorem can be applied to fn, giving 
es - —itx dt 
te) = ff ie 


Replace fn(x) by this expression in the scalar product: 


(adus = f sortat@yas = fey] | aloe | ae 
R R R 


Since f € L' and fe € L', the double integral converges and, applying Fubini’s 
theorem, we can reverse the order of integration: 


e itr dt _ - ey dt 
Uda = [ BO] f teeta] £= f Ronad. 


Let n — 00; then, by 2.4.5, || fn — f||;2 — 0, and the left-hand side thus tends 
to || f||Z2. Using (i) on the right-hand side, we obtain 


jim, [Lo e(-£) a = If lize. 


The sequence {ex(- e) \ is increasing. Applying the theorem of Fatou-Beppo 


Levi shows that |f(t)|? is integrable and that 


/ For = Wile 


Qn 


2.4.9 Plancherel’s extension theorem. The Fourier transform has an 
extension 


(2) U: L?(R) > L?(R). 


(ii) (2r)~2U is an isometric mapping of L?(R) > L?(R). 
(iit) U is a continuous bijection of L?(R) > L?(R). 
(iv) The inverse of U is given by 


U-(h) U(nh). 


il 
~~ On 
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Proor. Consider the mapping u: f > f, from V = L!M L? to L?. Then, 
by 2.4.8, 


(v) lluwliz2 = 2n|lulliz, Wo eV. 


Hence wu is a uniformly continuous mapping into the complete space L?. 
It thus has an extension to the closure of L! M L? in L?, which is just L?. 
Moreover, 2.4.8 extends by continuity and gives (ii). In particular, U is 
injective. It remains to prove (iii) and (iv). By 2.4.7(iv), 


A(R) c L'(R)NL?(R). 
Hence, by 2.4.7(i), 
U(L' nL?) > U(A(R)) = A(R) = A(R), whence 


(vi) U(L! NL?) is dense in L? by 2.4.7(vii). 


Next, since (27)~!U is an isometry, the image of L? is a complete, hence 
closed, subspace of L*. Thus (vi) implies that U is surjective. Finally, the 
inverse mapping of U is, up to a factor of 27, an isometry. It follows from 
(v) that it is determined by its restriction to A(R). The restriction is given 
by the Fourier inversion formula, and can be written as 


a eee of ee, Sete 
=— ‘ t)dt = — itz f(t\dt = — : 
fay= x f ee fna= = f ee Fna = Lue 
This expression for U~! on a dense set is valid everywhere, since U~! is 


continuous. 0 


(vii) REMARK. What is striking in Plancherel’s theorem is that it gives an 
isomorphism of spaces. Thus a problem posed in L? is equivalent under the 
Fourier transform to another problem posed in L?. 


2.5 Spectral synthesis on R” 


We now generalize the results of the last section to R”. Let 


1 1 
Gale) = Gag ew(-sEllel?) 


where ||z||? = (x!)? +... + (x")?. Then G(r) = []p_, G(x"). 
By (2.4.4), G,,(a*)dx* converges narrowly in M!(R) to the Dirac mea- 
sure at zero. When pu — 0, uz > 0, we find that 


G,,(x)dxz = @G,,(a*)dx* 
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converges narrowly to the Dirac measure at zero in M!(R"). Moreover, by 


(2.4.3), 
ty ullell? ite 
Gis Gay iD exp( ale dt. 


Spectral synthesis theorem. Let f € L'(R") and set 


| 7 2 
et? F(t) exp (-“") Gay : 


Gu(@) = f x Gy (x) = if 


R”™ 
Then ||f — gullz1 — 0. If, in addition, f € L' ML? (1 < p < +00), then 
If < Gull i» = 0. 
Fourier inversion theorem. Let 

A(R") ={f € L'(R"): fe L'(R")}. 


Then A(R") is dense in L?(R"), 1 <p < +00, and in Cy(R”). 
Furthermore, almost everywhere in x (with equality everywhere if f is con- 
tinuous). 


oe sare, DOL 4 
fay= fi fine 2. wpe AR"). 


Plancherel’s extension theorem. There exists a bijective mapping U of 
L?(R") onto L?(R") such that 


IOAllze = 27)" |fllzz and U(f)=f. VPEL nL. 


Moreover. 
1 aa 
Uh) = U(h). 
(h) = GU) 
The proofs of these results are identical to those already given for the 
case where n = 1. We end this section with a new result. 


2.6 Parseval’s lemma. Let f € A(R”) and let p € M'(R"). Then 


f(a)du(x) = Ff (t)ji(—t)at. 


R” (27) Rr 


PROOF. The Fourier inversion theorem, 
fle) = ame ft Fle rat 
e (27)” R ; 


can be used to write f as a function of f on the left-hand side of the 
assertion of the lemma. Since f € L', Fubini’s theorem can be applied to 
the resulting double integral. We obtain 


a f(a)du(a) = I. f(t)dt ( is : edule) x apr? 
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Corollary. Let y, ’ € M1(R") such that f(t) = f(t). Then w= p'. 
PrRooF. For all f € A(R”), 


| fle)du(e) = ie f(e)du! (a). 
R” R” 


Since A(R”) is dense in Co(R"), pw! = p. O 


3 Vector Differentiation and Sobolev Spaces 


3.1 Differentiation in the vector sense. The spaces WP 


The goal of this section is to interpret the notion of derivative in terms of 
translation operators. The advantage of this point of view is that, since the 
Fourier transform realizes the spectral analysis of translation operators, the 
same will be true for differentiation operators. 

Given f € L?(R") and a € R”, we say that the derivative of f in 
the direction of a exists in the L? sense and equals Daf if, when e — 0, 
lime7!(tea f — f) exists in L?(R"”) and equals —Daf. 

We then have 

Daf + eal =f) lbp 0: 


Let 
W? ={f € L*(R") : Daf exists in the L? sense for every a € R"}. 


Decomposing a = ate, +... +a@”en with respect to the canonical basis of 
R", we write Daf = Sla*D., f if f € W?. Given an integer s > 1, we 
define 

WP={few?: Daf ewer, Vf eER"}. 
If f © WP, Da, Da..--Da,f is defined recursively. 


3.1.1 Theorem (Spectral analysis of differentiation operators). Let 
feWi. Then 


Daf (t) = —i(a.t) f(t). 


Proor. D,f € L', and hence Daf is well defined. Since the convergence 
occurs in L!, the order of integration in the following expression can be 
reversed: 


Daft) = y lim €'(reaf ~ f)(w)e!* da = lime" (Tea ~ F(t). 
By 1.7.2, 


(eet < 1) 
€ 


—Daf(t) = lim F(t) =i(at)f(é).0 
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3.1.2 Corollary. If f ¢ W2, then 
(Da, Daz» Da, f)*(t) = T] (—ilax-t)) fo) 


3.1.3 Theorem. /f f € W}, then 
F(t) = o(llél*) as It|] > 20. 
Proor. Ds € L’. By 2.4.0.2, Ds f tends to zero at infinity. Hence |a-t|5 f(t) 
tends to zero at infinity, and this is true for every fixed a. 0 
3.1.4 Corollary. W},, CA={feL!: fe L'}. 
Proor. Since f(t) = o(|t||-"-!) and f € Co, it follows that fe L1. 0 


3.1.5 Proposition. Let 4. € M'(R") be a finite measure and let f € WP 
(where 1 < p< +00). Then tf € W? and 


Da(tuf) = Tu (Daf). 
PROOF. Tati f = TuTaf and «7 "(tea — 1) tu f = Tyle7 (Tea — DF. 


Since 7,, is a bounded operator on L?, the convergence of the right-hand 
side implies the convergence of the left-hand side. 0 


3.2 The space D(R") 


3.2.0 Definition. Let D(R”) denote the space of infinitely differentiable 
functions on R” with compact support. We show that D(R”) contains 
functions that are not identically zero. Let 


Fe). = exp(=t) if O<r<l 
= 0 if r<l. 
Set 
(7) F(x) = f(\lel|?), where |[x|? = (2)? +... + (@")*. 


Then F is infinitely differentiable. Since F> 0 on R” and F>0Qona 
nonempty open set, f F(x)dx > 0. Let F(x) = aF (x), where the constant 
a is determined so that f F(x)dz = 1. Then, setting 
(it) Fy (ay =A “FOC ts), 


it follows from 2.4.2 that F\(x)dx — 69 narrowly. 
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3.2.1 Proposition. If y € D, then py © WP (1 < p < +00) for every 
positive integer s. In particular, 3.1.2 holds. Furthermore, 


Op 
= k 
Dep Doe ae 


PROOF. We use Taylor’s formula with integral remainder: 
: Op Op 
-1 = be Soper 
[-e7 "(Tea — 1) + Day|(z) = [ : Gk (3% (x — aC) Dak @)) dc. 


The right-hand side tends to zero uniformly in 2 when « — 0. As its 
support lies inside a fixed compact set, we obtain convergence in all L? 
(l<p<+too).0 

3.2.2 Corollary. If f € L? andy €D, then f*p € WP for every integer 
s>0. 

PROOF. €7!(Teq — I)(f * y) = f * (Tea — Dew. 

The last term on the right-hand side converges in L’ by 3.2.1 applied to ¢, 
withp=1,s=1.0 


3.2.3 Proposition. Let 1 € M! and assume that yp has compact support. 
Then (tpy) € D for every p € D. 


Proor. Let K, be the support of yz and let K2 be the support of y. Then 
the support of 7,,y lies in the compact set Ky + Ko. 
Moreover, 


(ry9)(a) = a ple — y)du(y). 


1 


Differentiating with respect to z! under the integral sign is legitimate since 
24 is continuous and the integral is taken on a compact set. Hence 


ts) Oy 
Bui (THY) ST (55) oO 
3.2.4 Proposition. The space D is dense in L? (1 < p< +00). 


PRoor. Let f € L?. Using the truncation operator, we see that there exists 
f € L” such that f is zero outside a compact set and 


If —fllue <e. 
Set f* Fy = uy. Then, by 3.2.0(ii), 
ju, —filn» 20 as AO. 


Since f e LP and_f has compact support, it follows a fortiori that re Li. 
Hence, by 3.2.3, fx Fy € D. 0 
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3.3 Weak differentiation 


3.3.1 Definition. We denote by Lj... the functions which are integrable on 
every compact set. Given f € Lj,,., the Radon measure f(x)dz is called the 
measure associated with f. 

f € Lj,. is said to have a derivative in the direction of the vector a in 
the weak sense, or a weak derivative, if there exists u, € L}.. such that 


[fue = = f wav, Vp € D. 


The reader familiar with the distribution theory of Laurent Schwartz will 
recognize a special case of differentiation in the sense of distributions. 


loc 


3.3.2 Theorem. Let f € L”. Then the following statements are equivalent: 


(i) fEew?. 
(ii) For everya € R", Daf exists in the weak sense and D,f € L?. 


PROOF. (i) = (ii). The identity [(7af)h = f f(7-ah) implies 


(iii) fie (Tea ~1)f)h = [itch - hye! WKELP,hEL*. 


Writing (iii) with h = y, we can pass to the limit on the left-hand side 
since y € D C L*, and on the right-hand side since y € W/ by 3.2.1. This 
yields the formula for integration by parts: 


| Pofe=-f Oe) WF e wh, ger. 


Hence ug = Daf, and (iii) follows since D,f € L?. 
The proof that (ii) = (i) uses the following version of Taylor’s formula 
with integral remainder. 


3.3.3 Lemma. Let f € Lj, and suppose that f has a weak derivative in 
the direction of a, say uq. Let p. be the Radon measure defined by 


(9: Pe) = [ a-cava, Vg € C,(R"). 


Then 
—e lreaf - f | = Tpttg- 


PRooF. Let y € D. Using formula 3.3.2(iii), Taylor’s formula with integral 
remainder for y, and Fubini’s theorem, we have 


iemlerres _ fly = [ trea aa y)e! 
= Be Soja | Sat arte + Eads 
fel Sat Sele + ea) fade 
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Since f is weakly differentiable in the direction of a, 
Lak Pele teas ade = - f (r-gav)(e)uala)der 
= | 2) (reatte)(a)ae, 


whence 
[terat — fl +7,ua}(x)p(x)dz = 0, Ve ED. 


As we saw in 3.2.4, D is dense in L; this implies that the quantity in braces 
is the zero function of L?: 


(*) —e"Tteaf = f) a Tp. Ua UO 


3.3.4 Proof that (ii) = (i) in Theorem 3.3.2 


The result follows from considering the limit of the right-hand side of (*) 
and using 1.9.1. 0 


3.3.5 Corollary. Let {e1,...,€n} be a basis for R" and let {fn} be a 
sequence of functions in WP such that ||fn — f|lLp — 0 and, for all k, 
De, fn converges in L?. Then f € WP and, for any a in R”, 


|Paf 3 Dafn\|ie — 0. 
PROOF. It suffices to prove that f is weakly differentiable in the direction 


of a. The hypotheses allow us to write 


/ ipios / Di fie WOED. 


Since f, and De, fn converge in L? and since D,,y and ¢ are in L’, we 
can pass to the limit in this equation, obtaining 


[ tPae=- f elimDa tn) 
That is, f is weakly differentiable in the direction of e, and its weak deriva- 
tive is 
lim(De, fn) € L”. 


Let a € R”, say a = Yoa®ex. Then Da fn = D> a* De, fn, and hence f has a 
weak derivative in the direction of a which is equal to > a* lim De, fr. By 
Theorem 3.3.2, f € W? and Daf = dl a* lim De, fy = lim Daf. O 


3.3.6 Corollary. Let W? be given the norm 


fllwe =Ulfllce + 32 Dex filer, 
k 
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where {€1,...,@n} is the canonical basis of R". Then WP is a complete 
normed vector space and D, is a continuous mapping from WP into L?. 


PROOF. The only statement that is not obvious is that W? is complete. If 
{fn} is a Cauchy sequence in the WP norm, then both {f,} and {De, fn} 
are Cauchy sequences in the L? norm. 

Since L? is complete, f, converges to some f € L?. Moreover, f € W? 
by 3.3.5. By definition, 


Ilfn — fllwe = Wf — file + 90 [Deg fn — De, fllze. 
k 
Since ||De, fn — De, f\lL» — 0 by 3.3.5, f, converges to f in W?. O 


3.4 Action of D on WP. The space W?,,. 


3.4.1 Proposition. Let ~ € D and let the operation of multiplication by 
~, written my, be defined by (my f)(x) = p(x) f(x). Then 


my: WP > WP for every p € [1, +00] and for every integer s. 
PROOF. We prove the proposition when s = 1. First we show that 
(z) Dalef) = (Dav) f + eDaf. 


This formula is proved by passing to derivatives in the weak sense. Let Da denote 
the weak derivatives. Then 


/ DAghIS= i: pi(Da(v)) HED. 


Furthermore, by Leibnitz’s formula for continuously differentiable functions, 
—pDa(h) = ¥Da() — Da(yy), whence 


i Dep i: fDa(yv) + / fuD.() 


i 


/ Dal fyb + / wf Dal(¢). 


Let 2 7 

G = [Da(ef) — pDa(f) — fDaly)]. 
Then G is orthogonal to every w € D. Since D is dense in L’ if g < +00, it 
follows that G is zero. If p = 1, the fact that G = 0 follows from the density of 
D in Co(R”). Thus (i) is proved for weak derivatives: 


Da( fe) = vDaf + fDa(9). 


Since y and Day are in L™, the right-hand side is in L? if f € WP. Theorem 
3.3.2 then gives the result. 0 
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3.4.2 Differentiable partitions of unity 


Theorem. Let U, be an open cover of an open subset O of R". Then there 
exists a partition of unity pn such that 


O<¢n <1, 
Yn € D(R"), 
supp(Yn) C Vain) EO, 


and 
S- ¢n(z) =1, VreO. 


The series is locally finite; that is, for every compact subset K of O, 
supp (~n) NK = except for a finite number of indices. 


ProoF. Let 
: 1 
Kn = {2 € O: dist(x, O°) > 5 and ||z|| < n} : 


Then each K,, is a compact set contained in O, and the union of all the 
K,, equals O. By Theorem II-1.4.1 we can find a partition of unity with 
continuous functions f,. We may also assume that U, is a locally finite 
cover. Set 


2€, = dist(supp (fr), Un): 
Let Wn = F.,, * fn, where F, was defined in 3.2.0(ii). Then 
supp (Wn) C supp (fr) + B(O, €n) C Vain): 


By 3.2.3, Wn, € D since F.,, € D. 
Next, writing out the integral expression for wn, 


[tole — y) Fe, (y)dy = vn(2), 
we see that wn(x) > 0 whenever f,(x) > 0. Hence 


S> Un(2) > 0 for every z € O. 


Set 


r(z) = S> n(x). 


Then r~! is an infinitely differentiable function and yn, = r~ ‘Wn, satisfies 
the conditions of the theorem. 0 
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3.4.3 The spaces W?,,. 


(i) Let O be an open set in R”. We denote by D(O) the infinitely differ- 
entiable functions defined in O which have compact support. A function yp 
in D(O) can be extended to R” by setting y(x) = 0 for x ¢ O. 

Writing ~ for the extension of » to R", we note that ¢ is infinitely 
differentiable: given a point x) on the boundary of O, there exists an open 
neighborhood V of xo in R” which does not meet the support of y. Hence 


y vanishes identically in V and is therefore infinitely differentiable. Thus 
(v1) D(O) = {vg € D(R") : supp (y) C O}. 

We define 
W, 


s.loc 


(O) = {f defined and measurable on O: 
fe © WP(R”) for any yp € D(O)}. 


(iii) Proposition. f €¢ W?,,.(O) if and only if for every xy) € O there 


exists an open neighborhood V,, of xo in O such that 
pfEeW?(R") Vee D(R") with supp(y) C Va,. 


ProoF. The forward implication is trivial. The reverse implication is proved 
by using a partition of unity subordinate to the cover {V,, }, where x € O. 
Oo 


3.5 Sobolev spaces 


We now study the spaces W?. Since W? is a subspace of L? for every s, 
Plancherel’s theorem allows us to characterize its image under the Fourier 
transform. The space W? is written H® and called the Sobolev space of 
order s. The isomorphism of L?(R") onto L?(R") defined by Plancherel’s 
extension of the Fourier transform in 2.5 is denoted by F. 


3.5.1 Theorem. Let f € L?(R") and let h = F(f) be its Fourier-Plan- 
cherel transform. Then the following two statements are equivalent: 


(2) fC. 
wy | Jat) 2(1 + |\tl|2)°dt < +00. 
an 


PROOF. Restricting to the case where s = 1, we first show that (i) > (ii). 
For f € H', we have the following extension of Theorem 3.1.1: 


(iit) F (De, f) = —itrF(f)(t). 
To prove this, note that F(t., f) = e’* F(f) and 


OU F ag ae ea ng). 
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Since the left-hand side converges in L? to —F (De, f), the right-hand side 
also converges in L?. Passing to a subsequence €;,, convergence in L? implies 
convergence a.e.; (iii) follows since the right-hand side converges everywhere 
to (it,)h(t). Hence 


f © H® She L? and (t,h(t)) € L’, 


and therefore 
JAC) PFA +t+...+¢2) EL. 
We now prove that (ii) = (i). Let y € D; then, by Plancherel, 


[ 8 = oe [FFD 


By (iii) (or 3.1.1), F(De,¢)(t) = —ith.F(y)(t), whence 


1 : 
/ (Dab = Gn i (it, F(f) (QF ey at. 


By (ii), teF(f)(t) € L?. The inverse Plancherel isomorphism F~* can now 
be used to show that there exists a function u, € L? such that F(u,)(t) = 
—ith(F f)(t). Thus 


jf tPae= fae: 


that is, the weak derivative of f in the direction ey is the function ux, € L?. 
Theorem 3.3.2 shows that f € W? = H'.0 


3.5.2 Definition of H® for s not an integer 


Let s be a positive real number that is not an integer. Set 


wea {pents [a+ ine ier < too}. 


We define a norm on H* by 
(i) I fll3r6 = | (1+ tll?) (FEN) Pat. 
R” 


For s = 1, this norm is different from the W? norm introduced earlier, 
but the two are equivalent. The advantage of the present norm is that H* 
becomes a Hilbert space with scalar product 


(filfe)as =] (hiho)(t)(1 + |lel|*)°dt, where hy = F(fe), k= 1,2. 
R”™ 
3.5.3 Proposition. Let f ¢ H°. Then 
(2) Tuf € H* for every measure us € M}. 


PROOF. F(t, f)(t) = u(t)F(f)(t). Hence, since |”(t)| < |||] ,21, 3-5.2(i) implies 
that 
tu f laze < |lellarr fle < +00.0 
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3.5.4 Differential characterization of H® 


Proposition. Let f € L?(R") and let 0 < s <1. Then the following two 
statements are equivalent: 


(i) f © H°(R"). : 
(wt) I5(f) = Te Ite f —f llZ2 pee” < +00, wheren = dim(E). 


PROOF. We use the Fourier-Plancherel isomorphism. Let u = F(f). Then 


dx ps Je 
Iw) = fo cope fei - Pale Pas, 


Next, we set 


: ; dx 
ME =| eee in 
ane ee Ter 


This integral is invariant under the mapping x — A.zx, where A is an 
orthogonal matrix. Hence A(‘A.€) = A(€); that is, there exists a function 
w:R* — R* such that \(€) = W(||€l]). 

Note that, under the dilation € +> a€ (a > 0), 


2 ae 
_ —ie.a& 442 
wealiel) = fh leire’ — Pe 


Setting ax = y gives 
—t a“"d n+2s s 
walle) = fh len’ —1P Pam = wall) = aoe. 


Setting ||€|| = 1, this shows that w(a) = a?%y(1). Hence A(E) = e|lE||?°, 
where c is a strictly positive constant. Finally, 


in(u) =e J ll)Plel a. 


Since f is assumed to be an L? function, [ |u(€)|?dé < +00. Hence the 
finiteness of [,(u) is equivalent to that of 


[mora + herr. 
ie 


Corollary. Let f € L?(E), where s is a positive real number. Let s be 
decomposed as s = p+ s’, with 0 < s’ < 1 and p an integer. Then the 
following statements (itt) and (iv) are equivalent: 


(iii) f € A(R”). 
(iv) (DED... DE" f) © H®, Ym such that |m| < p. 
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REMARK. If s’ = 0, then H* = L? and this is the definition of H® for 
integer s given in 3.1. 

If s' > 0, then 0 < s’ < 1 and membership in H s’ is characterized by 
convergence of the integral (ii). 


PROOF. Set Ff = u; then (iii) becomes 
EPer? £7" (1+ Ell)” Jue L?, Ym such that |m| < p. 
This is equivalent to 
(1+ |l€l|)°u € L?.0 
3.5.5 Operator of multiplication by a differentiable function 
Proposition. Let p € D(R”) and let f ¢ H*. Then yf € H*. 


PROOF. The result was proved for integer n in 3.4.1. Using 3.5.4(iii) reduces 
the proof to the case where 0 < s’ < 1. 


We begin by writing 


(i) ta(vf) — of = oltef — f) + Te(pf) — ote(f). 
Then, since y is bounded, 
(it) e(t2f — f\iln2 < |lellze Ite f — filez- 


Set ¢— ao = y; then J, |r2o (vf) — vteofldx = f lef — (729%) fldy. Thus 


IIte(Vf) — oreo (Allz2 = I(t-2oe — 9) fllzz < If llzzll7-20% — vlli~- 


By the mean value theorem, 
(iit) I|7-20P — lla < Clleoll”. 


Substituting inequalities (ii) and (iii) into (i), we obtain the integral convergence 
criterion 3.5.4(ii). 


3.5.6 The spaces Hj,.(O) 
Let O be an open set in R”. We say that f € L?,,(O) if flx € L?(R”) 


loc 
for every compact subset K of R”. For s > 0, we say that f € Hj,(O) if 
yf € H*(R") Ve € D(O). The next proposition follows essentially from 


3.5.5. 


Proposition. Let f € L?,,(O) and suppose that, for every xo € O, there 
exists a function y € D(O) such that p(ro) #0 and yf € H*(R"). Then 
f € Hy,.(O). 


PROOF. Let v € D(O) be such that v = 1 on a neighborhood of zo; assume that 
its support supp (v) is small enough that y(x) 4 0 on supp (v). Multiplying by 
aar(2), we obtain 


Vao € O 30 € D(O) such that 0~ € H*(R”) and 9 = 1 on a neighborhood of zo. 
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Let Uz, = interior of @~'(1). Then, for xo € O, the collection {U;,} is an open 
cover of O. Let y1,...,¥n € D(O) be a partition of unity subordinate to this 
cover. Then xn f = xXn6@f, where @ corresponds to the open set U containing 
supp (Xn). By 3.5.5, xn(@f) € H*; that is, yn f € H* for every s. 

Let y € D(O). Then the identity f = }\ xnf gives yf = SY) exnf. This sum 
is finite and all the terms are in H*; hence yf € H°. 


3.5.7 Invariance under diffeomorphism 


Theorem. Let O be an open set in R” and let g be an infinitely differ- 
entiable diffeomorphism from O onto an open set O. If f € Hj,.(O), then 
(fog) € Hj,.(O). 

PROOF. We use the criterion in 3.5.6. If s is an integer, it suffices to compute 
the derivatives of the composite function wo g (where w € D(O)) and to use the 
characterization of H® by means of weak derivatives. 


By using 3.5.4, we may assume that 0 < s < 1 and that f and f have compact 
support. Then the integral 3.5.4(ii) becomes 


a | Fal: ay 2__dx 
Li= f lflate +) — Town aay 


Consider the mapping of x defined by 


Py(x) = g(z + y) — gly). 
Then p is a diffeomorphism for fixed y. Let 


n= wy a Flaty) + plo) - Flow eae. 


Setting g(y) = y and pg(x) = z gives dy = (det g~')(y)dy and dx = det(p, ')dz. 
By the change-of-variables formula for multiple integrals, 


Lf) = i dyldet fires 2) ~ FP aet vy A < 400, 


Since g is a diffeomorphism and all that matters is its restriction to a compact 
set, there exists a constant c; such that ||z|| > ci||z||. Similarly, there exists an 
upper bound cz for the functional determinants, and 


Lf) <¢ | dy ‘| F@+2)- FOP < +00. 
ce Tl 


The integral is finite because f € H*(O). 


3.5.8 Trace theorem. Let f ¢ H*(R"), and consider R”-? CR”. Then 
the restriction operator 


pp: D(R") > D(R”-?) 
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has a continuous operator extension 
Ho 3 He-P/2 if s> . 


PRooF. Let y € D(R") and write z = y+ z, where y € R” ? and z€ R?. 
Then, by the inversion formula, 


oly+2)= f etetea eae 
R” 
Similarly, writing = + ¢, 
ey) =f eB + C)dnde. 
RP? xR"-P 
Letting p(y) denote the operator of restriction to R”~?, we obtain 


(i) oer =f ain+ ac. 


Moreover, 


llo(y) z \(oly))(m)2 + [pall2)°-?/2dy 

2 
fol An + 6a (1+ Ilnl2)*-?/2dn. 
R°-p |JRP 


By the Cauchy-Schwarz inequality, 


Ps 


ll 


2 
[20 +O). + In + ¢\l?)°/7(a + ||ln+ ¢lI2)~8/2ac 


<(f aoa) ( [lene ora+ In +c1P)*ac) 


The first integral on the right-hand side, say J(7), converges since s > p/2. 
Moreover, 


(1+ [ln + ¢ll?)* = (+ In? + lIcl?)° 


d¢ 
J — = . 
(n) Wi (1+ [[n|l? + Il¢ll?)s ae a este 


The first integral is bounded above by 


vol({IGIL< tl) 
(1 + ||n||?)s 


and 


< C(1 + |In|?)P?-° 
and the second by 


‘ 2\p/2—s 
Ta Wri2\5 1 Pp 
ies (1 + Ilcll?)* s+ lal 
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whence J(n) < e(1 + |In||?)?/?~* and 


No e)lle-ve Sef dn(a + In? /28(1 + InP)? 
«| flan + OPC + Ine clPyrac| 
Rp 
d ~ 2 2)s = ae 
< ef an] f iatn +o) + lin clPy'ac] = ell 
Thus 
(ii ler(Pllasv2 Selle. if 9 > &. 


The existence of the desired extension follows from the density of D(R”) 
in H°(R"”). 0 
3.5.9 Corollary (Serge Bernstein). Let s > n/2. Then H*(R") Cc 
Cy(R”), where Cy(R”) denotes the bounded continuous functions. 
PROOF. The inequality 3.5.8(ii), with p = n, gives 

lpn(v)(0)| < ellellas, Ve € D(R"). 


Since the H* norm is translation invariant, |pn(y)(x)| < clly|lH> for every 
xz € R”, whence, taking the sup over z, 


(7) len (Y)llo,R") S ellvllas- 


Let f € H*(R”). There exists a sequence yg € D(R”) such that || f—¢llas 
0. Then 
Pn(Pq) = Uq € Cr(R"). 


The uw, converge uniformly by (i); hence 


lim u(x) = u € C,(R").0 


3.5.10 Theorem. Let O be an open set in R” and let V be an (n — p)- 
dimensional submanifold of R” such that V C O. If s > &, then there 
exists a continuous restriction operator 


Ay,-(O) = He. (V), where s =s-—-. 


PROOF. H;,,.(V) is defined via an atlas of charts on V. This definition is indepen- 
dent of the choice of atlas, since passage from one atlas to another is accomplished 
by local diffeomorphisms.' The result follows from Theorem 3.5.7. 

Given vo € V, there is a local diffeomorphism from a neighborhood U of vo 
to O such that the image of VNU is the space R”-? C R”, and 3.5.9 can be 
applied. 0 


1See, for example, W. Boothby, An Introduction to Differentiable Manifolds 
and Riemannian Geometry (New York: McGraw-Hill, 1987). 
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4 Fourier Transform 
of Tempered Distributions 


Plancherel’s theorem, characterizing the image of L? under the Fourier 
transform, played a major role in the last section. Although we hardly 
considered the spaces WP (s integer, p # 2), the systematic use of the 
Plancherel isomorphism enabled us to study the spaces H* = W?. In Sec- 
tion 5, we will study pseudo-differential operators by restricting our atten- 
tion to their action on the classes H*, where we will again use the Plancherel 
isomorphism. 

In this section, we characterize the image under the Fourier transform of 
the space S(R”) of infinitely differentiable functions which, together with 
all their derivatives, are of rapid decrease. The Fourier transform is an 
isomorphism from S(R”) onto itself, and S(R") will be given a topology 
in which this isomorphism is continuous. The dual of S(R”) is the space of 
tempered distributions S’(R”) of Laurent Schwartz; the Fourier transform 
induces, by transposition, an isomorphism from S’(R”) onto itself. 

Our study of the Sobolev spaces of negative order will parallel that of 
S(R") and S'(R"). 


4.1 The space S(R") 
(i) Functions of rapid decrease 
Definition. A continuous function f on R” is said to be of rapid decrease 
if, for any integer m, 
(1+ |Ix||?)"F(x) +0 as [lal] — oo. 


The space of functions of rapid decrease is denoted by Co,o(R”) and 
equipped with the following sequence of norms: 


[Lf llm.o = max (1+ [lx|")"1F()]. 


Co,o(R") is thus a vector subspace of Co(R”), the space of continuous 
functions which vanish at infinity. Moreover, 
Coo(R”) = {f E Co(R”) 7 Il f llm.o < +00 Ym}. 

We define 
(Gi) S(R”) = {f € Coo(R”) : O°f € Coo(R"), Va=(ay---.dn); 
where 

alal f(x) 
(Ox!)h ... (Ox) In’ 
and this derivative is assumed to exist in the elementary sense. In other 


words, f is infinitely differentiable and all its derivatives are of rapid de- 
crease. 


Of = 
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(iii) Norms on S(R”) 


A countable family of norms is defined on S by 


Ff llm.r = sup |07 f llm.o- 


lqi<r 


These norms can be used to give S(R”) a metrizable topology, with 
distance defined by 


d(f, = ~(r¢m) [Fille _ 
FTE ne 

d(f.f) = df -f',0). 

(iv) D(R”) is a dense subset of S(R”) 


Let vy be an element of D(R”) such that y = 1 on a neighborhood of zero. 
Set Yn(z) = (2). If f € S(R”), then d(f, fn) + 0 and fy, € D. 


A linear functional / on S(R”) is continuous if and only if there exist m, 
r, and a constant c such that 


WCF) < ell filmer 


I| 


4.2 Isomorphism of S(R") under the Fourier transform 
Theorem (Laurent Schwartz). Let f € S(R"). Then 
(i) fe L! and the Fourier inversion theorem can be applied: 


fey= f Foe ar. 


(it) fe S(R") and there exist constants cy, such that 
lf lls < Crs |lF llem+ss where m>n. 


(itt) The mapping f — f defines a topological isomorphism of S(R”) 
onto S(R"). . 

(iv) (2* f)*(a) = ig FO) 

n ae 

(v) (see ty’ () = -ite Flt ; 

(vi) If f. g€ S(R"), then fg € S(R") and (fg) = fxg. 
(vii) If f, g€ S(R”), then fxg €S(R") and (f *g)* = fg. 
REMARKS. From now on, whenever there is no possibility of confusion, 
S(R") will be abbreviated by S. The Fourier transform on S has all the 
right properties: it maps differentiations to multiplications (by —7 times the 
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variable of differentiation) and vice versa, and convolutions to products and 
vice versa. 


ProoF. If f eS, 


J teveeiae f (srt) (net) ae 


This identity can be checked by an integration by parts on the right-hand 
side; the variation of the integrated term vanishes because f is of rapid 


of \* ae =e 
Bani (t) = —it, f(t), and (iv) is proved. More- 


(The last inequality uses the fact that (1 + eh Aiki ea 8 a 5 We) 
In general, it follows from repeated integrations by parts that (07) f(t) = 
(—i)!4lt9 f(t), whence |t?f(t)| < cllO%fllm.o, and finally 


decrease. It follows that ( 


Of 
Ox! 


since m>n. 


ini fo] < [54 


(viit) Fllr0 < Cmllfllmr- 
Hence f € S implies f € Coo C Li. 
Thus the Fourier inversion formula can be applied, and (i) is proved. 


Let 07 be a derivative of order q in t. It can be computed by differentiating 
the Fourier integral under the integral sign: 


af) = fle) floydr = fe "(—i)Ml0% f(w)ae. 
Since 27 f(x) € S, it follows from (viii) that 


7 fll < emlla? f(z) llr 


Writing out in detail the norm on the right-hand side gives 


lle? F(2) line = YO f(2)) Ilm.o- 


\l|<r 
By Leibnitz’s formula for the derivative of a product, 
O (at f) = 5 COr2t) (0 fy). 


It follows that ||z* f(x) ||mjr < Cmqllfllm4¢.r, Whence 


WF llr < Crsllfllm+sr- 


This proves (ii). 
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To prove (iii), we must show that the mapping f > a is surjective. Let 
. d 
h € S be given, and set hy(x) = [rer oe. Then 
(27)” 
(iz) hi(x) = Boh 


and h,; € S by (ii). We now compute its Fourier transform. 


[layer ae. 


A 


Ay(A) = fer nteide = oy [ieee ae = 


l 
(2)” 


By (i), hy = h. This shows that the Fourier transform is surjective. The 
inverse transform, given by (ix), is continuous by (ii). Both the isomorphism 
S — S and its inverse are continuous: it is thus a topological isomorphism. 

Applying the Fourier isomorphism to formula (iv), which has already 
been proved, gives (v). 

Since f, g € S C L', 1.6.2 can be applied and (f * g)* = fG. It is clear 
that the product of two functions in S is in S: if fa € S and g € S, then 
fg €S. It follows that f *g € S. This proves the first part of (vii), and the 
second part follows from (vi) by the Fourier isomorphism. 0 


4.8 The Fourier transform in spaces of distributions 
4.3.1 Notation 


Using the notation of Laurent Schwartz, we write S’ for the vector space of 
continuous linear functionals on S. S’ is called the vector space of tempered 
distributions on R”. For example, let 4 € M(R") be such that there exist 
i and C for which 


(2) lul({a : |lzll < R}) < C(lal]? +1). 
Then f f(x)du(x) converges Vf € S and defines a distribution in Ss’. 
4.3.2 Operations on S’ 


These are derived by transposition from continuous linear operations on S. 


(i) Differentiation is a continuous linear operation on S. Since 


25 
Or} 


< film ti, 


m,? 


differentiation on S’ can be defined by 


~(ghcts) = (If). west 


The left-hand side clearly defines a continuous form on S’. 
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(ii) Multiplication by a polynomial P of degree k is a continuous operation on S. 
Since 
P(x) f(x) |lmr Secllfllm+kr, where c= c(P), 


multiplication by a polynomial on S’ can be defined by 
(Pf.l) = (f, Pl). 


(iii) S is an algebra: the product of two functions in S is a function in S. That 
S’ is an S-module follows from the formula 


(hf, 0) = (f, hl), Vf ES, 


where / and h are fixed elements of S’ and S, respectively. 


4.3.3 The weak topology on S’ 


Definition. A sequence |, € S’ is said to converge weakly to lo if 
(f,ln) converges to (f,lo), VfES. 


Proposition. The operations defined in 4.3.2 are continuous in the weak topology 
on S’. 
In particular, if ln — lo weakly, then 


O i] 


a ge 


In other words, the differentiation operator is a continuous operator on S in the 
topology of weak convergence of sequences. 


PROOF. We prove this for differentiation: 


(Ff get'n) = Caer’): 


Since 2h eS if f € S, the right-hand side converges to (~5F 0), Oo 
4.3.4 Theorem (Laurent Schwartz). Let a mapping Fg: : S’ > S' be 
defined by setting 


(f, Fgil) = (fil). 


Then F g: is an isomorphism from S' onto S’, mapping weakly convergent 
sequences to weakly convergent sequences. 

Moreover, F gs: can be restricted to L and L? by means of the inclusions 
L' cS’, L? CS’. The restriction of Fg: to L' gives the Fourier integral; 
the restriction of Fg: to L? gives the Fourier-Plancherel transform. 

Finally, the inverse of Fg is given by 


Fg (u) =Fg(t), Wes’. 


REMARK. If p1 is a positive measure satisfying 4.3.1(i), F.g/() is defined 
even though the integral f(t) might diverge for every t. 
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PROOF. Fixing | € S’ and setting 
vf) = (F0, 


we obtain a linear functional on S which, as the composition of continuous map- 
pings, is itself continuous. Hence there exists 1; € S’ such that y(f) = (f,h). 
Let 

b= Fogi(l). 


Since f > fis an isomorphism of S onto S, its transpose Fg: is an isomor- 
phism from S’ onto S’. Moreover, by Parseval’s relation (cf. 2.6), 


~ 


(fu)=(f.u), VfEes, Wel. 
Hence Fg is an extension of the Fourier integral on L*. The same result holds 
on L?. 

Finally, the inversion formula for F ¢’ is proved by transposing the inversion 
formula on S. 


4.3.5 Support of a distribution 


Let 1 € S’. We say that | is zero on the open set O if I(y) = 0 for any 
y € S(R”) such that supp (y) C O. Differentiable partitions of unity can 
be used to show that there exists a largest open set Q on which 1 is zero. 
The complement of Q is called the support of l. 

4.3.6 Sobolev scales of distributions 


For a fixed positive real number s, let D(R”) be given the H~* norm 
defined by 


livlla-: = sup ff pfdz, where feH’, |lfilx> <1. 


Since D is dense in H®, ||y||7-s = 0 implies that y = 0. 
Using the notation of Sobolev, we let H~*(R”) denote the completion 
of the space D with respect to the H~* norm. 


Theorem (Sobolev). The Fourier transform extends from D to H~* and 
realizes an isometric isomorphism from H—* onto L?(R”, 1s), where duis = 
(1+ |l¢||?)~Sdt. 


PROOF. If f € H®, then f € L? and the Fourier-Plancherel isomorphism gives 
[ etae= [owner 
Hence 


kell» = sup : p(t)u(t), with | ju(t)2(1 + [ltll?)Sat <1. 
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By the Cauchy-Schwarz inequality, 


| [awa = | feo + ey + te) o(04e 
jay?) 
= hears 


whence 


1/2 
lella-s < [ioirestod : 


Equality occurs when v(t) = cp(t)(1+4 ||t||?)~*, with the constant c determined 
so that |luj|j~> = 1.0 


4.3.7 Comparison of the two theories 
(i) Proposition. For every s > 0, H~°(R”) c S'(R”). 
PROOF. S(R”) C H°(R”). Moreover, 


: n 
Ifllmr > Wfllne if Bs, m> ZF. 


Let 6 € H~*. Then @ defines a linear functional on H* and 
lO(f)| <ellfllas <cllfilmr Vf € H". 


Hence @ is continuous on H® if H® is given the topology induced by that of S. 
Restricting 9 to S gives a continuous linear functional 6; on S and 6 +> 6; defines 
the desired map H~* — S’. 

This map is injective: D is dense in H*; a fortiori, so is S; thus a linear func- 
tional on H* that vanishes on S is identically zero. 0 


(ii) Proposition. Let 1 ¢ S' and suppose that | has compact support. Then 
there exists p such that 1 € H~?(R”). 


Proor. There exists a pair of integers m, r such that 
UA) <ellfllmr Vf €S(R"). 


Let y € D(R”) such that y = 1 on the support of /. Then I(fy) = I(f), 
whence |l(f)| < ellyf|lm.r. But 


ef llmr S clellmrllfilwe- 


Moreover, by the corollary to the trace theorem, ||f||z- < el|fllas if 
s> § and ||fllwx < cllfllazs+-. Hence 


IL(F)| < cll fllastr- 


Thus | extends to a continuous linear functional on H°*+", whence | € 
AOSs-? 
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5 Pseudo-differential Operators 


The Fourier transform on R” diagonalizes linear differential operators with 
constant coefficients. This property leads to representation theorems for the 
solution of the homogeneous equation as a limit of sums of complex expo- 
nentials, as well as existence theorems for the nonhomogeneous equation. 
These theorems, due to Leon Ehrenpreis and Bernard Malgrange, use the 
Fourier transform in C” as a fundamental tool. 

Complex-analytic methods are needed to prove these theorems, which 
are naturally formulated in the context of Laurent Schwartz’s theory of 
distributions. 

To obtain such general results, we would need not only to study locally 
convex topologies on spaces of distributions and duality between locally 
convex spaces, but also to prove minimum modulus theorems for holomor- 
phic functions of several complex variables. All these methods originate in 
different currents of thought from those we have followed up to now. 

We will study differentiable operators with variable rather than constant 
coefficients, and on bounded open subsets of R” rather than on all of R”. In 
physics, differentiable operators with variable coefficients invariably appear 
when an inhomogeneous medium is considered. 

At first glance, Fourier analysis seems to have no means of obtaining re- 
sults in this setting. It was thus a striking result when Alberto Calderon, in 
1957, introduced an “infinitesimal Fourier transform on the tangent space” , 
which assigns a “symbol” to an operator and thereby embeds differential 
operators in the wider class of pseudo-differential operators. In this class, 
one introduces an infinitesimal symbolic calculus which consists of multiply- 
ing symbols. Calderon’s symbolic calculus theorem states that the symbolic 
calculus corresponds to the composition of operators modulo regularizing 
operators, i.e. with the gain of one derivative. 

The pseudo-inverse of a differential operator can be explicitly constructed 
in integral form. 

This section ends with an application of the pseudo-inverse, in the proof 
of the elliptic regularity theorem. 

Pseudo-differential operators are a basic tool of the theory of partial 
differential equations. The spectral pseudo-decomposition they effect, and 
the integral estimates they entail, make up, to some degree, the extension 
of Sections 1 to 4 of this chapter. 


5.1 Symbol of a differential operator 
5.1.0 Notation 


In order to distinguish clearly between the variables x € R” and t € R” of 
the function f(x) and its Fourier transform f(t), we set R” = E, where E 
is an n-dimensional vector space over R, and write its dual as E’. The dual 
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pairing is denoted by 
(x,&) or v.€, wherexe E, €€ E. 


For a fixed choice of volume measure on E£, the Fourier transform is written 
f= | f(xje*Sdz, where f €L'(E£). 
E 
The volume measure dé on E£ is fixed so that, on L'(E) 9 L?(B), 


[ue@Pae = [feras 


Similarly, if y € A(£), the Fourier inversion formula is written 


(i) y(a) = [e@e*ae. 


The two measures dz and d€ are called associated. The Fourier-Plancherel 
transform is an isometry of L?(E) onto L?(E). We observe the convention 
of choosing a basis for F in such a way that the associated volume element 
is equal to 1. Under these conditions, we are led to define two bases ex of 
E and e* of FE as Fourier-dual if 


(e;,e*) = Wm if k=), 
(e;,e*) O° fs heey. 


Let E(E) be the vector space of infinitely differentiable functions on E, 
and let D(£) be the subspace of €(F) consisting of functions with compact 
support. We will consider differential operators of the form 


(ii) La ano" 
|m|<s 
where m = (m,...,™M,,) denotes a multi-index, that is a system of n non- 


negative integers. Let |m| = mi+...+mMn, let 0; = 0/Oz!,...,0, = 0/Ox", 
and let 0™ = 0)" ...0". The coefficients a(x) will be “sufficiently dif- 
ferentiable” functions of x. If L is not the zero operator, the largest |m| 
such that am # 0 is called its order. 

Given y € €(E), we define 


(Ly)(t) = SY) am(x)(O"y)(z). 


|m|<s 


If am € E(£), then L defines a linear operator from €(F) to €(E). The 
symbol of the operator L is the function defined on E x E by 


(iit) or(2,€) =e**Lipe(z), where ge(x) =e. 
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Writing this out in a basis, 


(iv) o1(@.€) = S~ dm(x)(—i€1)"™ «.. (iE). 


The symbol is thus a polynomial in € for every fixed x. The advantage 
of (iii) is that it is independent of the choice of basis, while (iv) appears to 
depend on the choice of basis. 

A differential operator can evidently be reconstructed from its symbol; 
it suffices to write the symbol, in a basis, as a polynomial in €, and to 
substitute 70, for & in the monomials. This elementary calculation can be 
replaced by an integral expression, which has the immense advantage of 
being applicable to functions o(x,€) more general than polynomials in €. 


5.1.1 Theorem. Let L be a differential operator on E with symbol oy (2, &). 
Then 


(i) (Lyp)(x) = fpovle eee "at. vy € D(E), 


where ((§) = fi y(x)e'"Sdxr denotes the Fourier transform of y. 


ProorF. By 4.1(iv), D(E) ia S(E) = S(E), whence @ is of rapid decrease. 
Thus o;,(a,.£€)(&) is of rapid decrease and the integral in (i) is convergent. 
Moreover, by differentiating the inversion formula 


te) =f aleen**ag 
E 
with respect to 0, we obtain 
(rele) = f GE(—ies)e*™ Ue, 
and more generally 
(Of... ay)tc) = f (-iery™ ... -igy™BLeem* Fae. 


The theorem follows by multiplying both sides by a(x), pulling am(x) 
through the integral sign, and summing over m. 0 


5.2 Definition of a pseudo-differential operator on D(E) 
5.2.1 The class of symbols C(3,r, 0) 


Let 3 be a real number and let r be a positive integer. We define a class 
C(G,r,0) of measurable functions g on E x E which satisfy the following 
two conditions. 


5 Pseudo-differential Operators 159 


(i) q has compact support in x; that is, there exists a compact subset K of 
FE such that 


qg(z,€)=0 if w¢ékK, forany €¢€ EB. 
Derivatives with respect to x in E x E are denoted by 0"”. The functions 
q are required to satisfy the following regularity condition. 


(ii) |](1 + |lE|) 20" ¢(a, 3) ret < +oo for every multi-index n such that 
In| <r. 


5.2.2 EXAMPLE. Let L be the differential operator of order s considered in 
5.1.0(ii). If the coefficients of DL are in W°°, then 


or(a, £) ‘= C(-s, Tr, 0). 


It is clear from this example that, in the class C(3,r,0), the integer r 
corresponds to the regularity of the coefficients and the number —/ to the 
order of the operator. 


5.2.3 Pseudo-differential operators defined on D(F) 


With a given symbol g € C(G,7r,0) and function » € D(E), we associate 
the function 


(i) (Agy)(2) = [pate8eee at 


The integral converges since, for fixed x, ¢ is of rapid decrease in € and 
g is of polynomial growth in €. Differentiating under the integral sign with 
respect to x shows that A,y € W,°°, and it follows from 5.2.1 that A,y has 
compact support. All these observations are trivial; the following theorem 
is not. 


5.8 Extension of pseudo-differential operators 
to Sobolev spaces 


5.3.0 Theorem. Let g € C(8,r +1,0) and let n = dim(E). Assume that 
s > -—B8 satisfies 


(2) O0<s<r—-n. 
Then there exists a constant c, such that 
(i) |Aggvllas <csllellas+e, Ve € D(E). 


(iti) There exists a unique extension of Ag to a bounded operator Ag from 
H*(E) to H8+4(E). 
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PROOF. Statement (iii) follows from the density of D(£) in H*(F) and 
from inequality (ii). 

Since the H* norms can be computed in terms of the Fourier transform, 
(ii) can be expressed as an inequality between Fourier transforms. Since 
A,y is a bounded function with compact support, its Fourier transform 
can be computed. This computation leads to the following lemma. 


5.3.1 Lemma. 


(i Ayoln) = fi Kalen s)R(@de, 
where 
(i) Kylne) =f gla.ge"Oade, 
PROOF. 
(Age)(n) = I cl" Agey(u)de = fede [ate eleeras, 


The double integral [ pup l9(@ §)P(§)|d&dx converges: it is bounded above 
by 


NC + NEI? 9@. 8) = cox B) meas (A) Z PCE) + Nel) Pa, 


where A’ denotes the support in x of the symbol, and the integral on E 
converges because ¢ is of rapid decrease. Hence Fubini’s theorem can be 
applied to reverse the order of integration: 


Avon) = f aeas f g(a get de. 
E E 
Fubini’s theorem guarantees that the integral on E' converges for almost 


every €. Since g has compact support in 2, it actually converges for every 
€, and there exists a constant c such that 


(wit) [Kg (7.€)| < c(1 + ||él|)~ meas (’). 


5.3.2 Estimating the kernel kK, 


Lemma. Suppose that g € C(3,r + 1,0) and let r’ be the integer defined 
by r < 2r’ <r+1. Then 


(7) |Ko(,€)| <1 + IEII)~P + We = al?) 


Proor. Let {x*} be an orthonormal basis with respect to the metric |{z||. 
In terms of this basis, the Laplace operator on F is defined by 


A; = > (x wZ 
k=1 
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Then 
A,e** = —lé||?etS. 


Let At be the differential operator on E with constant coefficients de- 
fined by ; noe 
Ay =(1-Az)) =1-r/A+---+(-1) A’. 


Then A” e!®--8) = (1 + ||n — €l]?)" e@-(-©),, whence 
Kalney(a-+Iin—€Py" = ff ale.) aze™-P de. 


Since g(z,€) has compact support, we can integrate by parts and turn 
derivatives of the exponential into derivatives on g. Thus 


(1+ lin €IPY"Ky(n.8) = f (Az (a) Pde, 
and (i) follows by 5.2.1(ii). 


5.3.3 Proof of the extension theorem 


(i) Lemma. Let f € L?(E) and let F(f) denote the Fourier transform of 
f. Then f € H® if and only if F(f)(€) = (1+ llEll) ~8k(€) with k € L?(B). 


PrRooF. Cf. 3.5.1. 0 


(ii) Lemma. Let Ko(n,€) = K(n.€) (44 )" + fely-?. Let 


Gof) = f Kolne ys ea. 
Then 5.3.0(ii) is equivalent to the inequality 
IGoflleagy <lfll gags VE € L°(B). 


PRooF. By 5.3.3(i) and 5.3.1(ii). 0 


(iii) Lemma. 
|Kg(n,€)| < c(1 + |l€ - nl2)7" +3 
PROOF. This follows from the inequality 


G + Ilnll 
1+ léll 


which is proved by considering the following two cases: 


) svat in~ el) 


(a) ||7|| < 2||€|]. Observe that the left-hand side is less than or equal to 2°. 
(b) |lnl| > 211g ||. Observe that 1+ |||] < 1+ 2€ — nl]. O 
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(iv) CONCLUSION. To prove 5.3.0(ii), note that |G, f| is bounded above by 
replacing K, with an upper bound for |K,|. Using 5.3.3(iii), it must be 


shown that 
lf (&)| 
/ G+ le -alyroe™ 


The left-hand side can be written formally as |f| * u, where u(€) = (1+ 
Weyer 

Next, -2r/+ 5 <s—r < ~—n by 5.3.0(i), whence u € L?. Finally, using 
1.8.2, 


< ellfllz2cz)- 
L?(E) 


| flee Ice S [ello (flln2- 


5.4. Calderon’s symbolic pseudo-calculus 
5.4.0 Motivation 


The Fourier transform maps a differential operator L° with constant coeffi- 
cients to multiplication by the symbol o;0(&). (The hypothesis of constant 
coefficients is reflected in the fact that the symbol no longer depends on z.) 
Thus the composition of constant-coefficient differential operators L° and 
M° — that is, the differential operator Q° = L°M° — corresponds to the 
product of symbols ¢go = apoyo. The differential operators with constant 
coefficients form a commutative algebra for which the Fourier transform 
makes possible, to some extent, a spectral theory. 

The differential operators with variable, but infinitely differentiable, co- 
efficients also form an algebra: two such operators can be composed. But 
this algebra is no longer commutative. 

For example, consider the differential operators L = x! sor and M = st 
on R. Then 


ous O es O 
= 1 _— =] nl _— A — IV SSS 
IM=2x (<7) . ML 5zl +2 (=) , and LM-ML Bai" 


Commutativity has been lost. Nevertheless, the commutator LM — ML 
is an operator of order 1, while the product is an operator of order 2. 
One might say that commutativity is preserved, modulo operators of lower 
order. 


5.4.1 Introduction to the classes C(,r, 1) 


A subclass C(3,r,1) of the symbols C(3,r,0) is defined by imposing the 
following additional axiom: 


(i) CAF NEI? (O24), Olly aocgex Bh < FO, Yen such that fm] <r, 


Similarly, a class C(3,r, 8) could be defined by differentiating s times with 
respect to € instead of once, and replacing @ by 6+ s. These classes would 


5 Pseudo-differential Operators 163 


appear in computing multiple commutators; such computations would arise 
from taking limits that we have held fixed. 


Pseudo-products 


Let p and q be the symbols of the pseudo-differential operators Ap and 

Aq. The pseudo-product of Ap, and A, is the operator whose symbol is the 

product of the symbols. This operator is written A,O.A, and, by definition, 
Agp = Aq Ap. 

With the formula for the derivative of a product, it is easy to verify that 

(ii) if g € C(B,r,0) and h € C(f’,r,0), then gh € C(B + B',r,0). 


The pseudo-product is a commutative operation and therefore cannot cor- 
respond to the composition of operators. However, it does give an approx- 
imation. 


5.4.2 Calderon’s commutation theorem. Let p € C(3,2r + 2,0) and 
letq € C(8’,r +1,1). Suppose that r > 6’ +1. Set 


R= AgAp — AqUAp. 
Then, for s such thatO <s<n-—r, 
(i) R: H°(E) > H°t8+8'+1 (BF) 
and there exists a constant cs such that 


(22) WRF llystetess < Csllfllae- 


PROOF. Since D(E) is dense in H°, it suffices to prove (ii) when f € D(E). 
As in the extension theorem, we take the Fourier transform of both sides 
of the inequality. For f €¢ D(EF), let 


A, fn) = Apt = [Koln €) flOae. 


The kernel K, was computed in 5.3.1. 
The proof of this theorem will require several lemmas. 


5.4.3 Lemma. 
(i) (Ay(ApN)) = f Gon €)FEaE. where 
(i) Gal) = fodu ff ple + heal. mere dh de 


PROOF. Composing the kernels gives 


G8= [.KaQmKyln.8)dn. 
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Replacing AK, and K, by the expressions given in Lemma 5.3.1, 
Gio fu | p(x, E)q(z, ne@™E-MF20-) dr dz, 
E E2 
Setting « =z+h and z=z in E?, andy =€-pin EB, we obtain 


Gap = [uf p(z +h, €)q(z,€ — pet t-N de dh.O 
E E2 


5.4.4 Lemma. Let O be a compact subset of E containing the supports of 
p(z,.) and q(x,.). Then there exists an even function u € D(E) such that 
u(x, — 22) =] of X41, r €O, 


(i) yp) = [ow / i le + h,8)a(z,€~ we" Mu(hyeltdh de, 


and 


(ii) = ie du | i u(herran) 


PRooF. Let 

O,={ye B:y=2-22, x, € O}. 
Then O; is a compact subset of E containing the origin. There exists a 
function u € D(E) equal to 1 on Oy. 

The right-hand side of formula 5.4.3(ii) is nonzero if z+h € O and z € O; 
that is, if hk € O,. Multiplication by u(h) is multiplication by 1; this proves 
formula (i). 

The second formula is obtained by applying the Fourier inversion formula 
to u € D(£) and noting that, since the origin is in O,, u(0) = 1.0 
REMARK. We must be careful not to write a double integral in (ii), since Fubini’s 
theorem does not apply. Similarly, 5.4.3(ii) cannot be written as a triple integral. 


5.4.5 Lemma. (Gy p — Kgp)(A,€) = L(A, €) + J(A,), where 
() He = ff apple e)laleg - u) ~ a2,8)) eo dea 


(it) 10.8) = fidu f aeg—we"eMae [faa and 


| if ah] = [ole +h8) ~ ve. semrulnyah. 


PROOF. Formulas (i) and (ii) of Lemma 5.4.4 and 5.4.3(i) imply that 


Gog Kooy S Seger dedp fe'"| Ju(h)dh, where 

plz +h, €)a(z,€ — uw) — p(z,€)a(z,€) =[ ], +[ Jy with 
plz; E)(a(z, & oe Ht) — q(z, &)) and 

a(z,€ — w)(p(z + h, €) — plz, €)). 


Lo dl 
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Note that the first term no longer contains h; hence the integration in h affects 
only e'“"u(h), which, since u is even, gives u(j). Thus we have 


1.62 [ou / RUAL jeeeptze= a Vae 
E E 


Since u € L', Fubini can now be applied to obtain (i). Integrating the expression 
[ ], and applying Fubini to the integral [ ‘tes yield (ii). O 


5.4.6 Lemma. Set 
g(2z,€) = [aes — p) — 9(z,€))u(p)dp, 
EB 


and let 
I(z, €) = Plz, €)g(z, €). 
Then I = Kj. 


PRooF. Integrate 5.4.3(i) with respect to yz, then use Lemma 5.3.1. O 


5.4.7 Estimating the integral I 


We use the extension theorem 5.3.0 to show that 
(i) gec(p + 1,74 1,0). 


5.4.1 (ii) will then imply that pg € C(8 + 6’ +1,r4+1,0). 
We first use Taylor’s formula with integral remainder on EF to obtain 


(2,8) =f Om ak(2, €, 14) @(n)dp, where 


aK (2, g p)=— i; (0g, q(2, € _ ty))dt. 


Differentiating with respect to z gives 
a8 (a6) =D f (oP axle. 8.0) )A(uana 
k 
whence, by 5.4.1(i), 


1 
(ii) A™(g(z.€))| <e I | (1+ |€ = tull)-9 Ifa) Budde. 


Let v(i) = ||] |@(e)|. Then v is of rapid decrease. Set 


Fe) = f+ eta) onde = f+ f 
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where A = {: |[te|| > Hey, For any integer m, 


: —m 
| S ih Lav <p, (1 e i) : 
A t 


[scat he- les 


Hence, taking m > 3’ + 1, 


Moreover, 


(iii) F(t,€) < Cy(1+ fel? 7, 


where the constant C) is independent of ¢. Integrating with respect to t 
gives (i). 


5.4.8 Estimating the integral J 


We now use Taylor’s formula with integral remainder on E. Set 


1 
Coe | (Op) th, Hae. 


i) 


Then p(z + h,€) — p(z.€) = > h* pp. Writing ih, = Oune’” gives 


= (plz +h. €) — plz, €))e""u(h)dh = | S- pa (Ounce )u(h)dh. 
E E 


Since jz appears only in the exponential terms and we can differentiate under 
the integral sign, 


[iol + he) ~ plage" ulurdu = 7 Bp. val2. 158) 
EB 
where 
(2) SAE Be Ss if pr (z,h, Ee” ulh)dh. 
E 


Since y, and wu are sufficiently differentiable in h and u has compact support, it 
follows that 7, which can be regarded as a Fourier transform in h, vanishes at 
infinity together with its first derivative. Substituting into 5.4.5(ii) and reversing 
the order of integration, we can thus integrate by parts on E with respect to Ur, 
and we obtain 


I(x, =>0 | eed [onalas ~ pu 2, ms ee 
k EB E 


Let 
gr (2, €) = [enna — p)We(z, pw, E)dp. 
E 
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Then J = )>, Ky,- We now show that 
(it) ge €C(B + B' +1,7+1,0) 
by finding an upper bound for 
Unlee) = Sy [LOM an aya ~ war vale ede 
m1,™m2 a 


with |mi| + |ma| <r +1. Since g € C(8',r, 1), 
(ai) (UC) eS [LO ea area see 
m2 EB 


(iv) Lemma. There exists a constant c, independent of €, such that 


Cc 


B em oe Tn 
(1 + {lél)) | z Wrz, ws, €)| < (1+ |lull2)"”’ 


where r’ is the integer such that r < 2r’ <r+1. 


PROOF. Using (i), 
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/ ((—An + 1)" [u(h) OR? (pe(z, hb, €)) et dh = (1+ |Iqll?)” OF? de (z, 1, h).- 
E 


The inequality follows, with 


c = meas (K)|(—An +1)" [u(h)OP* pe(2 Hs AYll poo cayeBy" 


Here K is the support of p in z. 


The following lemma, 5.4.9, together with (iii), (iv), and the hypothesis that 


r > 6 +1, imply that 


[Ux (2,€)| < c(1 + lll) P+ WEI? 
That is, (ii) is proved, and with it the commutation theorem 5.4.2. 0 


5.4.9 Lemma. Let r be a positive number and let 
hy(n) =(1+|Inll)7", where ne E, dim E=n. 
Then, ifr >n ands >0, 
hp *hs <c(r,s)hi, where t= inf(r,s). 


PROOF. Let 


1 1 
Orhan =f ora psaer?= ft fh 
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where A(7) = {A: 5llnll < |IAll < 2lln||}. Then 


1 c 
<max ye Ac Doe | hells S ——— and 
/ a ee (in all + r) “SOF Tm 


1 
SMaX ye a(n) BT lla Ta(Rs)Ilets 
/ A(n) Meee Ese Ayre eas 


where 7 denotes the translation operator. 
(i) Ifs > n, then ||La(m)T(Rs)|1r1 < |[Rsl]z1 < +00. Hence 


1 
< ¢c——_—_— 
Le ~ (1+ AID" 


hy *hs <c(hy ths) < che. 
(il) Is <m, then |ILacnt(Aadllar S Synycajey bs = e+ II)" * and 


and 


hp *hs < che + C1 + IIE)" O7" < clhs + Rstr—n)- 


The conclusion follows by noting that s+r—n > s, whence h, *hs < chs. Since 
s<n<r, the lemma is proved. 


5.5 Elliptic regularity 
5.5.0 Definition. Let L be a differential operator defined on an open subset 
O of R”: 
L= SS Gy_(x)O™. 
Im|<d 


Let o,(x,€) be its symbol. L is said to be an elliptic operator if, for every 
compact subset K of O, there exist two constants c,, co depending on K 
such that 


(7) lox(x,€)| 2 call€ll if 2 € K and if |[§|| > co. 


(ii) EXAMPLE. Consider the Cauchy-Riemann operator on R?, 


6) 0 
Lo = ml + ina (where i = V-1). 


Then 
OLy (€) = —1&y ae &2 and loz, (€)| =, Ell. 


(iii) EXAMPLE. On R", consider the operator 
= —S oak) (x)d,0; + C*(x)O,« + q(x), 
where the matrix a;; is symmetric and positive definite. Then 


o1,(@,£) = > a* (x)Ex€s — 1C*(x)& + g(z). 
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For ||€|| sufficiently large, the quadratic form dominates the first-order 
terms and JL is elliptic. 


5.5.1 Theorem. Let L be an elliptic operator of order d defined on the 
open set O. Suppose that the coefficients of L are functions in WyP19, 
wherer >d+1,r>n. Let f € H#.(O); then Lf is well defined and 
Lf € L?,.(O). Under these hypotheses, the following two statements are 


; loc 
equivalent: 


(i) Lf € Hj,,(O), where 0<s<n-—r. 
fe He (0), where 0<s<n-r.(ti) 
PROOF. It is trivial that (ii) > (i). 

In order to prove that (i) > (ii), we construct a local pseudo-inverse of L. 
Here pseudo-inverse means an inverse in the sense of Calderon’s symbolic 
pseudo-calculus, and local means on a compact subset of O. Let O; be an 
open set such that O,; C O. Let y and wy be elements of D(OQ) such that 
y = 1onO, and y = 1 on the support of yp. Let L; = pL, u = Lf, ui = yu, 
and f; = Wf. Then u, € H°(E), fi € H2(E), and yL(wf) = yL(f) since 
w = 1 n the support of y. Hence 


(i) Ihfi = Ui- 


Let o1,(x,€) be the symbol of L;. Then o;, has compact support in x 


na 


(since its support is contained in the support of y). Let 6 € D(E) be equal 
to 1 if ral < c2(O1). Set 


g(x, €) = v(a)(1 — 0(€))[ox(2, 8)". 
Then it follows from 5.5.0(i) that g € C(d,r, 1). 
Moreover, let goz, = p, where p(z,€) = y?(x)(1—0(€)). Multiplying the 
two sides of (i) by Ag gives 
Aglif; = Agui =v, where vé Het. 
Set 6(x) = 6(—x). Then 
(Apfi)(2) = *(@)[fi(@) ~ (0 * fi)(a)). 
By the commutation theorem (5.4.2), 
Agl1 = Ap+R, where R:H* > H* +, 
Since y?(x)(0 * fi)(x) € D(E), it follows that 


(it) vht+Rf=W, with We H°+4(E). 
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Rf; € H*! since f; € H4%; thus y?f,; € H¢t! if s > 1. As this is true for 
every y € D(O), 
fe Hit(0), or fee Ht(0). 
This last relation holds for every y; hence f € H/t!(O), and we have 
gained a degree of differentiability. Working backwards, we conclude that 
fi © H4! and therefore Rf € H4+*?. 
Substituting into (ii) gives 


g?f=W-Rf, with WeH*t? and Rf ce Ht, 


Hence, if s > 2, f € H4+?(O,); as this is true for all O,, we conclude that 
fea, (0). 

Substituting again into (ii), we find that yf, € H4+°(E) for s > 3, 
and hence that f € H4*8(Q,). As this is true for all O,, it follows that 


fe He), Hence ia 
(bf) H™°(E) and R(bf) ¢ H7**(E). 
Substituting a third time into (ii) gives, as before, 
efeH (EF) if s>A4. 


We continue this procedure until forced to stop, when d+ 7 > s+d. The 

last possible step gives 
gfe HE), whence f € Hft4(O).0 

REMARK. With appropriate hypotheses on the differentiability of symbols, 
it is possible to let pseudo-differential operators act on Sobolev spaces of 
negative order and obtain the following improvement of the elliptic regular- 
ity theorem (5.5.1). Let L be an elliptic operator of order d with infinitely 
differentiable coefficients, and let s be a real number. Then Lf € Hj, im- 
plies that f ¢ HSt@. 


loc 


IV 


Hilbert Space Methods 
and Limit Theorems 
in Probability ‘Theory 


1 Foundations of Probability Theory 


1.1 Introductory remarks on the mathematical representation 
of a physical system 


Before we introduce the notion of probability, it seems advisable to describe 
the type of mathematical model used to represent a physical system. 
Representations can be given from two distinct points of view: 


e the point of view of essences, or 
e the point of view of phenomena. 


The point of view of essences, generally that of the pure mathematician, 
consists of thinking that the physical system can be perfectly known. The 
space of all possible states is introduced, and a state is a point in the space 
of states. This point of view is, for instance, that of rational mechanics: the 
state of a system of n physical points is completely determined by a point 
in R®” (position and velocity of each of the particles). 

The point of view of phenomena, generally that of the experimental 
physicist, consists of observing a few facts which occur in a physical system 
so complex that the physicist, at the outset, concedes that he will never 
understand its basic structure. For example, the physicist can use thermo- 
dynamics to analyze the phenomena of a gas without having to determine 
the state of all its molecules. 
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The mathematical model corresponding to a phenomenological represen- 
tation is based on a logical calculus. The physicist introduces the set B of 
all events that he will be in a position to observe in studying the physical 
system. B is given the structure of the logical calculus, in which 


e A, + Ag denotes the occurrence of the event A; or the event Ag; 


e A,.A2 denotes the occurrence of both the event A; and the event Ag; 
and 


e () denotes the impossible event and 1 the sure event. 


The set B of all events thus forms an abstract Boolean algebra. (See I-1 
for the definition of Boolean algebras of sets.) 

The phenomenological point of view, initially of more modest scope than 
the point of view of essences, is much more adaptable to describing gains 
in knowledge. Indeed, a physical system described twenty years ago by a 
Boolean algebra Bo of events can be described today, after a more detailed 
analysis, by a Boolean algebra B,. All the events that appeared twenty 
years ago in By will appear in B,. Thus there is an injective mapping 


Bo a Bi, 


which commutes with the operations of the logical calculus and permits 
Bo to be identified with a subalgebra of 6,. Progress in understanding the 
system is described by a sequence of Boolean algebras, 


Bo > By > By > Bs DP seas 


where the arrows are injective homomorphisms of Boolean algebras. This 
sequence will give progressively more detailed representations of the phys- 
ical system, although it may never arrive at a final representation that 
would correspond to complete understanding, beyond the reach of the ex- 
perimenter. 


1.2 Axiomatic definition of abstract Boolean algebras 


A Boolean algebra is a set B together with two commutative and associative 


operations, written 
AUA’ and ANA’. 


Each of the two operations is assumed to be distributive with respect to 
the other; that is, 


AU(BNC) = (AUB)N(AUC) and 


I 


AN(BUC) (AN B)U(ANC). 
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We assume further that there exist two elements @ and 1 in B such that 
AU®=A, AND=0, AU1=1, and AN1=A, 
and that there exists a mapping A — A‘ of B into B such that 
AUAS=1, ANAS=6, and (A‘)°=A. 


Using the commutativity and associativity of U and M and the distributivity 
of each of these relations with respect to the other, it is easy to verify that 


(AU B)® = ASN BS, (AN B)® = ASUBS. 
Finally, 1° = @ and 0° = 1. 
Associated order relation 


Given a Boolean algebra B and A,B € B, we say that A implies B, and 
write A< B,ifANB=A. 

It is easily verified that < is an order relation on B. With respect to this 
ordering, 1 is the largest element and @ the smallest element; that is, for 
any A€B,A<AK<1. 

Using the commutativity of U and M, we note that AUB and ANB are, 
respectively, an upper and a lower bound of A and B. In fact, AU B is the 
least upper bound of A and B and AM B is the greatest lower bound of A 
and B. Let us show this, for example, for AU B. Let C be an element of B 
such that A < C and B < C; then, by definition of the order relation, 


(AUB)NC =(ANC)U(BNC)=AUB, and AUB<C. 


1.8 Representation of a Boolean algebra 


How to pass from the point of view of essences to that of phenomena is 
clear. 

If Q is the space of states of the physical system being studied, we asso- 
ciate with an event A of this system the following subset of 2: 


A’ = {w € 2: the event A is satisfied by w}. 


The operations of the logical calculus correspond to taking unions and 
intersections in the set P(Q) of subsets of 2. With these two operations, 
P(Q) is a Boolean algebra. The following statement summarizes our obser- 
vations. 


1.3.1 The data of a phenomenological representation of a physical system 
of which the space of states 9 is known are equivalent to the data of a 
Boolean subalgebra of P(Q). 
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The converse, that every abstract Boolean algebra can be represented as 
a subalgebra of P(Q), is proved in the following fundamental theorem. 


1.3.2 Stone’s theorem. Let B be an abstract Boolean algebra. Then there 
exist a compact space 2 and a representation identifying B with a Boolean 
subalgebra of P(Q) of subsets that are both open and closed in Q. 


PROOF. The proof of Stone’s theorem is clear when card(B) < oo. In this 
case, we define atomic events as those that are minimal in B with respect 
to the relation <; then Q is the set of atomic events. 

In the general case, we introduce the notion of a filter on B. A filter F is a 
nonempty subset of B such that 


Ai, Ao © F implies Ai Ag € F; 
A, © F, A; < Ao implies Ao € F; 
and 
0 ¢ F. 
The inclusion relation on the set of subsets of B defines an order relation on 
the set of filters: 
Fi, 2>F2 if A€F2 implies A€ Fi. (Fy, is then called finer than F2). 


An ultrafilter is a filter U of B such that F = U for every filter F such that 
F >U. Zorn’s lemma shows that, given a filter Fo, there always exists an ultra- 
filter U/ finer than Fo.! 


1.3.3 Lemma. Let F be a filter on B and let Ay € B. Suppose that AN Av 4 9 
for any A € F. Set 


Fay ={Z€B:Z contains a set of the form AN Ao with A € F}. 


Then Fa, is a filter. 
PROOF. Clear. 


1.3.4 Lemma. A necessary and sufficient condition that a filter U be an 
ultrafilter on B is that, for any Ag € B, either Ag €U or Af EU. 


PROOF. Suppose that U/ is an ultrafilter. If Aj ¢ U, then it is impossible that 
AN Ao # 9 for every A € U. Otherwise 1.3.3 would imply that U/ 4, is an ultrafilter, 
necessarily finer than U since Ao € Uy; but this is a contradiction. Hence, if 
Ao €U and Ag ¢ U, there must exist X, Y € UU such that 


ApAX=0 and AGNY =¢%. 


From this it would follow that X MY = O, a contradiction. 

Conversely, let F be a finer filter than U. Let Ao € F. It is impossible that 
Ap € U, since this would imply Ag € F, a contradiction. Hence Ap € U and 
FH=U. 


1See Bourbaki, General Topology, 1.6.4. 
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1.3.5 PROOF OF STONE’S THEOREM. Let 2) be the space of ultrafilters on the 
Boolean algebra B. 
Let a mapping ¢ from B into P(Q) be defined by setting 


p(A)={UEN:AEU}, AEB. 


If A, > Ag and Ag € U, then A; € U; hence y is compatible with the order 
relations, and is thus a Boolean algebra homomorphism. Let us show that is 
injective. Suppose that A # B; then either AN B® # @ or A°N B # G. Suppose, 
for example, that AM B° 4 @, and consider the filter 


F={X €B:X>ANB‘}. 


Let U/ be a finer ultrafilter than F. Then U € y(A) and U ¢ y(B). 

To endow 2 with a topology, consider Q) = 2B , the product of infinitely many 
sets of two elements with the factors indexed by the set B. Then 9; is the product 
of compact spaces and hence is compact. Let a mapping ® : 2 — 9; be defined 
by setting 

®(UY) = {1y/(A)} ace, 
where 1,/(A) = 1 if A € U and is zero otherwise. ® is clearly injective; thus Q 
can be identified with a subset of 2;. We now prove that 


1.3.6 ®(Q) is a closed subset of Qi. 


PROOF OF 1.3.6. Let Q; be identified with the set of functions f defined on B 
and with values in {0,1}. We will need the following lemma. 


1.3.7 Lemma. f € 6(Q) if and only if the following conditions are satisfied for 
any A, A’, A”, A” € B: 


f@) = 90, 

f(A) < f(A) if ASA! 

f(A” nN A”) = inf( f(A”), f(A”), 
f(A)t f(A) = 1. 


PROOF. The first three conditions simply restate the fact that ®(U) is a filter, 
and the fourth that UY is an ultrafilter. 
Now let 


La={feM: f(A) + f(A) = 1}. 
Then Ly is a closed subset of Q,, and Nacgla is a closed subset of 1. 
Proceeding similarly with the other conditions of 1.3.7 completes the proof 
of 1.3.6. O 
With the topology induced by 21, ®(Q) is compact; pulling back this 


topology makes 2 a compact space. 
Fix Ap € B and define fo(U) = 1z4(Ao). Then 


y(Ao) = {U EM: fo) = 1}. 


Since fo is continuous, y(Apo) is a closed subset of 2. But (y(Ao))° = y(A§) 
is also closed, so y(Apg) is an open and closed subset of 0. This completes 
the proof of Stone’s theorem. 0 
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1.4 Probability spaces 
1.4.1 Definitions 


A probability space is a measure space (X,.A, 4) for which the measure jy 
has total mass 1: u(X) = 1. 

Following the usual practice in this field, we denote X by Q and ys by P. 
Thus a probability space is written in the form (Q,.A, P). 

A measurable set A € A is sometimes called an event. The measure of the 
measurable set A is called the probability of A and written P(A). Clearly 
0 < P(A) <1. 

P is called the probability measure. 

A property that is true a.e. on Q is called an almost sure (or a.s.) property. 


1.4.2 Transporting a probability measure 


Let (Q,.A. P) be a probability space and let (Y,B) be a measurable space. 
Let ® be a measurable mapping from 2 to Y: 


® € M((Q, A); (Y, B)). 
Then a probability measure P, is defined on (Y, 8) by setting 
1.4.3 P,(B) = P(®-1(B)). 


Axioms I-1.0.1 to 1.0.3 are easily verified. Moreover, P)(Y) = P(Q) = 1. 
P, is written 


1.4.4 P, = ©,(P) 


and called the direct image, or simply the image, of the probability measure 
P under the mapping ®. (®, P is sometimes called the measure induced by 
® on Y.) 


1.4.5 Proposition. Let (Q,A,P) be a probability space, let (Y,B) be a 
measurable space, and let ®, ®' € M((Q,A));(Y,B)). If O(w) = ®'(w) 
a.s., then ®,P = ®P. 


Proor. Let Ag = {w € 1: O(w) £ ®’(w)}. 

Then P(Ap) = 0 and P(A) = P(AN Af) VAE A. 

In particular, P(®~!(B)) = P(®~!(B) As) for any B € B. Ifw é€ 
®~-1(B) NM Ao, then ®/(w) = ®(w) € B, whence 6-1(B) N Af c (®’)-1(B), 
or 

P(®""(B)) < P((®’)"*(B)). 

Since the argument is symmetric in ® and ®’, the opposite inequality 

also holds. 


1.4.6 Corollary. The direct image ®,P depends only on the equivalence 
class of ® in Mp((Q, A); (Y, B)). 
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1.4.7 REMARK. In Chapter I, we never found it necessary to change the 
measure space, which was fixed once and for all. In probability theory, 
however, two operations will play a fundamental role: 


(i) transporting a probability by a measurable mapping; and 
(ii) restricting a probability to a sub-o-algebra . 


1.5 Morphisms of probability spaces 


1.5.1 Definition. Let (0,.A, P) and (9’,.A’, P’) be probability spaces and 
let 
& € Mp((Q, A); (9’, A’)). 


If 6,P = P’, ® is called a morphism of probability spaces and is said to 
preserve probabilities. 


1.5.2 The inverse image operation 


Let 6 € M((Q, A); (’,.A’)) and let (Y,B) be a measurable space. With 
ul € M((0’, A’); (Y, B)) 
we associate ®*u’, its inverse image under ®, defined by 


(O*u')(w) = (u! 0 B)(w). 


Then 
(*u') € M((Q, A); (¥, B)). 


(®*u’ is sometimes called the pullback of u’.) 
If we also assume that (,.A) and (’,.A’) are equipped with probability 
measures P and P’ and that ® is a morphism of probability spaces, then 


(i) The equivalence class of (®*u') in M,((Q, A); (Y,B)) depends only on 
the class of u! in Mp:((’, A’); (Y, B)). 


Let u’, uy, € M((9’, A’); (Y, B)) and set 


(ii) A = {w : Ae ‘\(w) # (@*uh)(w)} and A’ = {w’ : ul(w') # uy(w')}. 
Then A= @71(A’). 
P(A) = P(A’) = 0 since P’ = ®,P. 

By abuse of language, ®* will denote the inverse image mapping induced 
by ® between the spaces Mp and Mp:. 


(iii) Let ©, ®y E M((Q, A); (Q’, A’)) and suppose that ® = ®, a.s. Then ®* 


and &* define the same mapping from Mp:((9', A’); (Y,B)) to Mp((Q, A); 
(Y, B)). 
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If not, there would exist u’ € Mp: such that 
A={w: @*u' A®ju'} and P(A) >0. 
Let 
Ay = {wE 2: Ow) 4 Oi (w)}. 


Then A C A; and P(A;) = 0. But this implies that P(A) = 0, a contra- 
diction. 0 


(iv) Functoriality. Let ®; = ®2 0). Then (®3), = (®2), 0 (®1). and 
@3 = O70 OS. 

The proof is trivial. It suffices to recall that the composition of inverse 
images occurs in the opposite order to that of mappings. 


1.5.3 Injectivity proposition. Let © be a morphism of the probability 
space (Q,A,P) into (Q',A’,P’) and let (Y,B) be an arbitrary measure 
space. Then ®* defines an injective mapping 


Mp.((®, A); (¥,B)) > Mp((Q, A); (¥,B)). 
Proor. Let wu’, ui € Mp-((Q, A’); (Y. B)). Define u = O*u’, uy = O*ul, 
A={wruA wm}, and Av =dw! ss wh 


Then ®-!(4’) = (A) by 1.5.2(ii), whence P(A’) > 03 P(A) > 0.0 


1.5.4 Dynkin’s theorem (Measurability and functional depen- 
dence). Let (Q.A,P) and (Q',A’, P’) be two probability spaces, let ® 
be a morphism from the first to the second, and let B = ®~1(A’). Then 
u € LZ(Q,.A) can be written in the form 


(i) u=uo, with u’ € LY,,(0',A’) 
if and only if the class of u contains a B-measurable function. 


Proof. The forward implication is clear. Conversely, suppose that wu is 
B-measurable. Then, by I-6.4.2, there exists a sequence {f,,} of simple B- 
measurable functions that converges pointwise to u. If B € B, then there 
exists A’ € A’ such that B = 6—1(A’); hence 1g = ©*1y. 

This implies that every simple B-measurable function satisfies (i). Hence 
fn = ul, o ®, with ul, € L° (O!, A’). 


(ii) We show that u!, converges a.s. on 1. 
If not, there would exist « > 0 and A’ € A’, with P’(A’) > 0, such that 


sup |uj,(w)—ui,(w)|>«, Vp Wwe A’. 


mn>p 


Then u, would satisfy the same inequality on ®~!(A’). But this would 
contradict the a.s. convergence of f,,, since P(®~1(A’)) = P’(A‘) > 0. 
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Thus (ii) is proved. Let u’ = limu!, € L%,(, A’); then u = lim f, = 
uo®.0 
1.5.5 Corollary. Let ® be a probability space morphism from (Q, A, P) to 
(9, A’, P’) and let B = ®71(.A’). Using ®*, one can identify LE, (0', A’) 
with the subalgebra of L¥,(Q,.A) consisting of the B-measurable functions. 


PRooF. By 1.5.3 and 1.5.4. 


1.6 Random variables and distributions of random variables 


1.6.1 Definition. Given a probability space (Q,A,P), a random variable 
X is a class of measurable functions, that is an element of L%(,A). We 
will often write simply r.v. 


1.6.2 Definition. The distribution of the random variable X is the direct 
image of P under X. 
Thus X,P is a Borel measure on R of total mass 1. Hence, by I-3.1, 


(i) (X,P) defines a Radon measure of total mass 1. 


1.6.3 Definition. Given a finite set X1,..., X, of r.v. defined on the proba- 
bility space (Q, A, P), their joint distribution is the direct image of P under 
the mapping ® : w > R* defined by the Xp(w), L<p<k. 
It follows from I-2.4.2 and I-2.3.5 that ® € M((Q,.A); (R*, Br,)). 
Hence ©, P is a finite Borel measure on R¥ and, by II-3.1, 


(x) ®,P defines a Radon measure on R* of total mass 1. 


1.6.4 Let p; be the projection of an element of R* onto its first component, 
let p be the joint distribution of X;,...,X,, and let i be the distribution 
of X;. Then py = (p1)«fe- 

This follows from functoriality, 1.5.2(iv). 


1.7 Mathematical expectation and distributions 
1.7.0 Notation for expectations 


Let (9,.A,P) be a probability space and let X € Lp(,A). Then the 
mathematical expectation of X is written E(X) and defined by 


E(X) = / X(w)dP(w). 


The reader should note that the measure P, and the probability space 
Q itself, are implicit in the notation E. 
In this notation, the DL? norm is written 


[E(IY|9))/4 = [I¥ ze. 
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1.7.1 Change of variables 


Let (2,4, P) and (', A’, P’) be probability spaces, let & be a morphism 
from the first space to the second, and let ®* : L°(0’, A’) + L°(Q, A) be 
as defined in 1.5.3. 


Proposition. Let u’ € Lp, (0, A’). Then u = (®*u’) € LL(Q,A) and 
E(u) = E(u’). 


PROOF. Suppose that u’ is a simple function, say u’ = > ap14,. Then 
u= So ard, where A, = 67'!(A}). 


By 1.4.3, P(A) = P’(A;,), whence E(u) = E(w’). 

Let v’ € Ly,; then there exists a sequence {u/,} of simple functions such 
that 

B(\v! — u4) = Ilo! — ui llzs, 0. 

Let un = ®*u),. Then 

lun — Umllo, = E(lun — Um|) = E(\u;, — uj,|) +0 as m,n — too. 

Thus {u,} is a Cauchy sequence in Lp. Let v be its limit; then v € Lb. 
There exists a subsequence {uj}, : n € o} of {u!,} that converges a.e. on 
0’, Similarly, there exists a subsequence {u, :n € T} of {un :n € o} that 


converges a.e. on 2 to v. Then the relation up, = uj, o ® passes to the limit, 
and v = v’ o ®. Moreover, since 


E(v) =limE(u,) and E(v’) = lim E(u/,), 
the fact that E(u,) = E(u.) implies that 
E(v) = E(v’).0 


1.7.2 Computing expectations by means of distributions 


Let (Q,.A, P) be a probability space and let X,,...,X, be a finite set of r.v. 
defined on 9. Let be the Radon measure on R* that is the distribution 
of X1,..., Xp. 


Proposition. Let y € Lj, and let Y(w) = y(Xi(w),...,X(w)). Then 
Y € Li and E(Y) -{ pds. 
R* 


PROOF. By 1.7.1. 


1.8 Various notions of convergence in probability theory 


This section consists of two subsections. In the first, we introduce the vocab- 
ulary used in probability theory to study concepts that are already familiar. 
In the second, we study the new concept of convergence in distribution. 
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1.8.1 Vocabulary of probability theory 


Let {X,,} be a sequence of r.v. defined on the probability space (Q,.A, P), 
and let Y be another r.v. defined on the same probability space. 


Definitions 


(i) X, converges to Y almost surely (abbreviated a.s.) if X,(w) converges 
a.e. to Y(w). 


(ii) X,, converges to Y in mean if 


|Xn—-Y jz. 70, or E(|\X,-—Y|) > 0. 


(iii) X, converges to Y in mean square if 


|Xn—-Yllz2 70, or E(|X,-Y|?)-0. 


(iv) X, converges to Y in probability if Xn converges to Y in measure. 
(v) The relations among these different kinds of convergence were studied 
in Chapter I. 

1.8.2 Convergence in distribution 


Let (Qn,An, Pp) be a sequence of probability spaces and let (9’, A’, P’) be 
another probability space. 

Let X, € L°(Qn,An, Pn) and Y € L°(0', A’, P’) be given. We say that 
the sequence of distributions of X,;, converges to the distribution of Y if, 
writing 


(i) (Xn)ePn =n and Y,P’ =V 


for the respective distributions, 

(ii) Un converges narrowly to v. 

A sequence f4, such that 

(iii) fn converges narrowly 

is commonly, though rather ambiguously, described by saying that 


(iv) the r.v. Xp converge in distribution. 


1.8.3. Criterion for convergence in distribution 


Theorem. The r.v. X,, converge in distribution to the distribution of Y if 
and only if 


(7) limE(y(Xn)) =E(y(Y)), Ve € Cx(R). 
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Proor. By 1.7.2, in the notation of 1.8.2(i), 


E(y(X,)) = [ eam and E(y(Y)) = I pay. 


Thus 
[tin > fredn. Ye € Ce(R). 

That is, 

(ii) fn converges vaguely to v. 

By 1.6.2(i), 
fin(R) =1 and v(R)=1; 

hence lim ,(R.) = v(R), and II-6.8 shows that (ii) is equivalent to narrow 
convergence. O 


1.8.4 Extension to r.v. with values in R” 


An ordered m-tuple of r.v. X!,...,.X is called an r.v. with values in R”, 
or an R”-valued r.v. Such a r.v. is sometimes denoted by X € Mp((Q, A); 
(R”. Ban)). 


Given ar.v. X with values in R", its distribution is the joint distribution 
of the X* considered in 1.6.3; it is thus a Radon measure on R”™. 

A sequence of r.v. with values in R’, say xX, ae ae is said to 
converge in distribution to Xo if the sequence of distributions converges 
narrowly to that of Xs We have the following propositions. 


(i) The sequence of r.v. X,, with values in R™ converges to the distribution 
of Xo if and only if 
lim E(y(X,)) = E(y(Xo)), Ve € Cx(R™). 
In this criterion, a compactly supported y can be replaced by a bounded 


continuous y. The next statement results from letting w be a function that 
depends only on the first coordinate of R™ and applying 1.8.3. 


(ii) Jf ie converges in distribution to Rey then each component xe con- 
verges in distribution to X}. 


The converse of this statement is false. 
1.8.5 Comparison of convergence in distribution 
with other types of convergence 
Proposition. 


(i) A.s. convergence implies convergence in distribution. 
(ti) Convergence in probability implies convergence in distribution. 
(iti) Convergence in L? implies convergence in distribution. 
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Proor. Let the probability space 2 be fixed and let X,, Y € L°(Q, A, P). 
Assume first that X,, converges a.s. to Y. Then Vp € Cx(R), 9(Xn(w)) 
converges a.s. to y(Y(w). Since y is bounded, Lebesgue’s dominated con- 
vergence theorem can be applied to show that E(y(X,,)) ~ E(y(Y)). This, 
with 1.8.3, gives (i). 

Assume now that X, converges in probability to Y. By I-5.2.7, every 
subsequence {Xn}neo itself contains a subsequence {X,}neo such that 
{Xn}neo: converges a.s. Hence, if y € Cx(R), it follows from (i) that 


jim E(y(Xn)) = E(y(¥)). 

Let B, = E(y(X,)) and let y = E(y(Y)). Then every subsequence 
{Bn}neo of {8,} contains a subsequence {Bn}neo that converges to y¥. 
This implies that lim G,, = y, and (ii) now follows from 1.8.3. 

Finally, by I-9.3.1, convergence in L? implies convergence in probability; 
thus (iii) follows from (ii). 0 


2 Conditional Expectation 


2.0 Phenomenological meaning 


We now resume the discussion of the principles of probability theory begun 
in 1.1. 

From the phenomenological point of view, the set of all measurements 
an experimenter can possibly make on a physical system is represented by 
a Boolean algebra B. The physicist is interested in exhibiting the “laws of 
nature” in the context of B; given certain measurements, he would like to 
predict the values of others. 

There are two kinds of predictions. The first involves a functional depen- 
dence. For example, in Ohm’s law (that V = RI), the measurement of two 
quantities completely determines the third. The second involves a “corre- 
lation” without necessity; for example, a substantial drop in barometric 
pressure makes it “likely” that a cyclone is approaching. 

The experimenter represents the known information about the physical 
system by a subalgebra B’ of B. Given a physical quantity X, he asks himself 
the following questions. 


(a) Is X determined by the information B’? That is, in terms of 1.5.4, is X 
measurable with respect to the o-algebra generated by B’? 


(b) If not, the experimenter will try to extract from the information B’ 
all it implies about X. What is the most likely value of X? Does he risk 
making a major error by taking this most likely value as the value of X? 
And so on. 
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Passing to o-algebras generated by Boolean algebras allows the problem 
to be posed as follows: 

Given a probability space (Q,.A, P), a sub-c-algebra A’ of A, and X € 
L°(Q,.A, P), can X be approximated by Y € L°(Q, A’, P)? (We abuse lan- 
guage by writing P for the restriction of P to A’.) 

In the next section, we will try to solve this problem by using an ap- 


proximation that minimizes the L? norm, i.e. an orthogonal projection on 
ioe 


2.1 Conditional expectation as a projection operator on L? 


Let (Q,A,P) be a probability space and let B be a sub-o-algebra of A. 
L?(Q, B, P) is abbreviated as L?(B), and so on. 


2.1.1 Lemma. Let 1 < p < +00. Then L?(B) can be identified with a 
closed vector subspace of L?(A). 


Proor. A B-measurable function is A-measurable: £L°(B) C £L°(Q, A). The 
same holds for simple functions: E(B) C €(.A). Since the probability mea- 
sure on B is the restriction of that on A, the integral on the integrable 
simple functions E'(B) is given by restriction of the integral defined on 
E'(A). Endowing E'(B) with the norm || ||», we obtain an isometric 
mapping from E!(B) to E'(A). 

Since E(B) is dense in L?(B) and L”(Q, A, P) is complete, this isometry 
extends to an isometry 

L?(B) — L(A). 


The image of a complete space under an isometry is complete; hence the 
image of L?(B) is complete and, in particular, closed in L?(A). 


2.1.2 Definition. EB denotes the orthogonal projection operator from 
L?(A) onto L?(B). Given f € L?(A), EF (f) is called the conditional ex- 
pectation of f given B. 


2.1.3 Theorem (Properties of the conditional expectation). 


(i) BB (f) € 1(B) 
(ii) WEP (Ace < Weilae. 


Let B and C be sub-c-algebras of A such that B > C. Then 
(idi) ECRE = Bf and 
(iv) E(E®) = E. 
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(v) Let y € L®(B). Then EB (yf) = pEB(f), vf € L(A). 


Proor. Properties (i) and (ii) follow from properties of the orthogonal 
projection. 

The inclusion between o-algebras B D C implies, for functions, that 
L?(B) D> L?(C). 

Let f € L?(A) be decomposed as 


f=utzy, with v € (L7(B))+ andu= EB (f). 


Then u = wth, with h € (L?(C))+ and w = EC (u). Substituting this into 
the last line gives 


(vt) f=wt+(h+v). 
By definition, w € L?(C), and since 
L*(B) > L*(C) = (L7(B))~ c (L°(€))~, 


v € (L7(C))+. Hence h + v € (L7(C))+. The decomposition (vi) implies 
that w = EC ( f). Thus (iii) is proved. 

Let Ap denote the coarse o-algebra containing only the two sets Q and 
@. A function y is Ao-measurable if and only if it is constant. (L?(Apo))+ 
consists of the functions with zero expectation. Any function f € L? can 
be written as 


f=E(f)lo+h, where E(h) =0, 
and thus 
(vit) B45(f) = E(f)1o. 


By abuse of language, we identify the conditional expectation relative to 
Apo with the expectation. Then (iv) becomes a special case of (iii). 

It remains to prove (v). Let M, denote the bounded operator defined on 
L?(A) by multiplication by y. Thus M, : f + yf. Since y € L°(B) and 
L°(B) is an algebra, 

M,(L7(B)) c L?(B). 


Note that M, is a hermitian operator, that is, 


(My(fylg)z2 = (f|My(9))r2- 


This is just a restatement of the fact that 


E((yf)g) =E(f(vg)), Vig € L(A). 
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Since L?(B) is invariant under the hermitian operator M,, its orthogonal 
complement is invariant under M,. Thus, if f = u+v with u € L?(B) and 
v € (L?(B))+, then 


M,f = M,u+ Mgv, where (M,u) € L?(B), (M,v) € L?(B)+. 


That is, EP (MLS) = M,E® (f), and (v) is proved. O 


2.2 Conditional expectation and positivity 


2.2.1 Proposition. Let f € L*(A), f > 0, and let B be a sub-c-algebra of 
A. Then E®(f) > 0. 


Proor. Let B € B. Then, by (v), 
E(EF(f)1z) = E(E9(f12)) = E(f1z) > 0, 


where the second equality follows from (iv). Setting v = EB f. we have just 
shown that 


(i) E(vlg)>0, VBEB. 


Let By, = {w:v(w) < —n7"}. Since v € L°(B), B, € B; it follows from (i) 
that 
E(v1z, ) > 0. 


Moreover, E(v1lg,) < —n~!P(B,,). Hence P(B,) = 0 for all n, and thus 
P(UB,,) = lim P(B,,) = 0. 0 


2.2.2 Corollary. Let f, g € L?(A). Then 


(i) fogs Ef) > BF (9) 
and 
(ii) |E8(f)| < BB ((f)). 


PROOF. Since f — g > 0, we have EF (f — 9) > 0. Furthermore, —|f| < f < 
| f| implies (ii). 


2.3 Extension of conditional expectation to L! 


Theorem. The operator EB defined on L?(A) in 2.1 has a continuous 


extension EP, defined on L'(A) and with values in L'(B). This extension 
has the following properties: 


(i) eBip) =f forevery f €L*(B). 
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(i) EB llo < Wille. 
(iit) If BDC, then el eB _ ef. in particular, Ech = B. 
(iv) If p € L®(B), then EF pf) = vE®(f). 


PRoor. Let f € L?. Then |BB ¢| < B3(/f\) by 2.2.2(ii), and hence E(\E® f|) 
< B(E8(\f|)). It follows from 2.1.3(iv) that 


B(B5 (| f))) = E((fl) = Wile. 
That is, 


(v) WEF Flin <Iflla, VF © L2(A). 


Thus EP is a bounded operator when L?(.A) is equipped with the L1 norm. 
Since L1(B) is complete and L?(.A) is dense in L(A), EP can be extended 
to an operator from L(A) to L1(B). This extension is denoted by eB. 
Since EF (f) = f if f € L?(B) and since L?(B) is dense in L1(B), the 
operator extended by continuity has the same property; this implies (i). 
Assertion (ii) follows from (v). 
(iii) and (iv) are obtained from 2.1.3(iii), (iv) and (v), which we extend 
by continuity. 0 
(vi) ABUSE OF LANGUAGE. From now on we use the same notation, namely 
EP for both €8 and BB. 


2.4 Calculating EP when B is a finite o-algebra 


Let B be a finite sub-o-algebra of A and let e€),...,e, be the atoms of B 
with strictly positive probability. 


2.4.1 Proposition. EP (f) = Doagle,, where ap = Pre, (flex): 


PROOF. Since EF (f) € L°(B), it suffices to check that f — EP (Pf) is or- 
thogonal to L°(B). Since the 1,, form a basis for L°(B), it suffices to show 
that 


E((f — EB (f))1.,) =0. 
But 
E(f1e,) = asE(1-,) = as(P(es)).0 


2.4.2 Definition. Let a measure zy, be defined on A by setting 


pa(A) = Fay Panen) 


Note that ,(Q) = 1. 
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Lx, is called the conditional probability given the atom ex. With the no- 


tation of 2.4.1, 
a, = [fais 


2.4.3 Proposition. Let B be a finite o-algebra of A, let p be a conver 
function, and let f € L'(A), f > 0. Then 


y(EB(f)) < EF (y(f)). 


PrRoorFr. Retaining the notation of 2.4.1 and letting uw, denote the condi- 
tional probabilities, we have 


EF (y(f)) = SBele,. where By =f e(f)dux, and 
e(BF(f)) = Sylax)le,, where ax =f dp. 


Since yx, has total mass 1, Jensen’s inequality (I-9.2.2) can be applied, 
and shows that y(ax) < B,. 0 


2.5 Approximation by finite o-algebras 


2.5.1 Proposition. Let f,,..., fn € L1(A). Then there exists an increas- 
ing sequence B, C...B, Cc... CB of finite o-algebras such that 


NEB fy - BB fiz +0 ask—00, F=1,2..00. 


PROOF. We first consider the case where n = 1, and write f for f,. Let 


u= E®(f); then u € L1(B), and hence u is the limit in L1 of a sequence 
{ux} of simple functions in L'(B). Let By, be the o-algebra generated by 


the us, 8s < k; then By C B and uy is By-measurable. EB« (up) = Up and 
EP* (uz, — u)|| < lug — ullza, whence 


EB * (aw) — BB+ (up)|[o2 + JEB* (ux) — ull 
EF" (u — we)Ier + (lux — wll < 2llu— well. 


EP au — ula 


IA IA 


Moreover, EP, = BB: (BP Ff) = EBs (f) by 2.3(iii). Thus 
JEP f - EP flip. < 2IJu — fll 0. 


This ends the proof for n = 1. 


The general case is treated by induction on n. Letting {Bi} be a sequence of 
finite o-algebras adapted to fo,..., fn, we take BZ to be the o-algebra generated 
by B;, and By. 0 
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2.5.2 Corollary (Jensen’s inequality). Let B be a sub-o-algebra of A, 
let f € L'(A), f > 0, and let y be a nonnegative convex function such that 
E((~(f)]) < +00. Then 


o(EB(f)) < BB (y(f). 


PROOF. By 2.5.1, there exists a sequence of finite o-algebras B, such that 
EOF — fl) 20 and EF (p(f)) - v(f)Iln = 0. 


By 2.4.3, o(EP*(f)) < BF (y(f)), or BB*(y(f)) — o(BF*(f)) > 0. 
Since L1 convergence preserves positivity, 


B5 (pf) — (BB (f)) > 0.0 


2.6 Conditional expectation and L” spaces 


Let 1 < p< +o. Then, since L?(A) c L!(A), the conditional expectation 
operator EB is defined on L? (A). 


2.6.1 Proposition. Let 1 < p < +oo. If f € L?(A), then EF (f) € L?(B) 
and 


(i) WEF (Pie < [flee 

Let p and q be conjugate exponents. Then 

(ii) BB (fg) = gEP(f), vf L(A), 9 € L%(B). 

(iii) 

E((B39)(B8 ¢)) = E(gEF(f)) = E(fER (9), Vf € L(A), 9 € L(A). 


Proor. If 1 < p < +00, the function y(t) = t?, t > 0, is convex. Hence (i) 
follows (except when p = co) from 2.5.2 (Jensen’s inequality). 

It remains to prove (i) if p = co. Given f € L©, we can find a sequence 
B, of finite sub-c-algebras such that 


BB ¢ — BF FI. 0. 


Using the expressions given in 2.4.1 and 2.4.2, 


[ta 


Let uz, be a subsequence of vu, = EB: f such that uz, EB 5 a.s. Then, 
since 


EF f\|,~ = sup 


< ||f ll ze. 


lve.) S Illa, 
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(i) holds for p = co. 
Note that (ii) holds for bounded functions by 2.3(iv). Using the truncation 
operator (I-6.7), we can find sequences f, € D°°(A) and gn € L™(B) such that 


Ilfn — fil» 0 (whence, by (i). EB f, — BF fll» 0) and llgn — gllnx = 0. 
Hence, by 2.3(iv), EB (fgn) = gnEB (fa). 
Since || fngn — fg||_1 — 0 by Holder’s inequality, EB (fn) —EB (fg) II = 0. 
Similarly, gnE® fy, converges to gE® f in L', and (ii) follows. 


When f, g € L?(A), we consider the scalar product 
(flg)i2 = E(fg). 
By the properties of the orthogonal projection, 
(B8 f\g) = (f1B% 9) = (BP F/B 9). 


Since L® Cc L?, this proves (iii) for the special case where f, g € L*(A). 
The general case is proved by using the truncation operator as above. 


3 Independence and Orthogonality 


3.0 Independence of two sub-o-algebras 


3.0.1 Definition. Let 6B and C be two sub-o-algebras of the probability 
space (Q,.A, P). B and C are said to be independent (relative to P) if L?(B) 
and L?(C) are orthogonal on the constant functions; that is, if 


f €L*(B), g € L*(C), and E(f) = E(g) = 0 imply E(fg) = 0. 


REMARKS. 


(i) The notion of independence involves the L? norm, and thus the proba- 
bility measure P. To be precise, we should speak of independence relative 
to P. Since we have considered P as given once and for all, by abuse of 
language we say simply independent. 


(ii) Since both L?(B) and L?(C) contain the function 1p, they can never be 
orthogonal; independence corresponds to the strongest notion of orthogo- 
nality that can be expected. 

(iii) Consider the codimension-1 subspace 7 composed of functions orthog- 
onal to the constant functions: 


H = {f ¢ L(A): E(f) = 0}. 


The relation E = EE? implies EB (nH) Cc H. Moreover, 3.0.1 can be 
written as 
HN L?(B) is orthogonal to HM L?(C). 
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(iv) It follows from 3.0.1 that L?(B)/M L?(C) reduces to the constant func- 
tions. 

Since L?(B)N L?(C) = L?(B NC), where BNC is the o-algebra of those 
functions in 2 that belong to both B and C, we conclude that if B and C 
are independent, then BNC reduces to the sets of probability zero and their 
complements. Up to sets of probability zero, BNC is thus equivalent to the 
coarse o-algebra. 


3.0.2. Mutual independence of n sub-o-algebras 


Let By,...,By, be n sub-o-algebras of A, let H be a subset of [0, 1], and let 
By be the o-algebra generated by {B; :i € H}. Then By,...,B, are said 
to be mutually independent if 


By and Bye are independent o-algebras for every H € P([0,1)). 


3.1 Independence of random variables and of o-algebras 


(i) Let (Q,A,P) be a probability space and let B and C be two sub- 
o-algebras that are independent on this space. Let B’ and C’ be two other 
sub-o-algebras such that B’ Cc B and C’ c C. Then 


B' and C’ are independent. 


Indeed, L?(B) NH D> L?(B')NH and L?(C) NH D L?(C') NH. Hence the 
orthogonality of the first pair of subspaces implies the orthogonality of the 
second pair. 


(ii) Let X1,...,X, be mn random variables and let By = X;,' (Br). Then 
Xj 1,...,Xpn are said to be mutually independent if the B, are mutually 
independent o-algebras. 


(iii) Let Dy,...,D, be mutually independent sub-c-algebras of the proba- 
bility space (Q, A, P). Let X; be a D;,-measurable r.v. defined on (Q, A, P). 
Then the r.v. X, are independent. 

This follows from (ii) and the fact that X;'(Br) C Dp. 


(iv) Stability of independence under a change of variables. Let X1,...,Xn 
be independent r.v., let y1,...,(@n be Borel functions from R to R, and 
let Y; = pr(Xx). Then the Y; are mutually independent r.v. 

Y, (Br) © X,;'(e;'(Br)) C X;'(Br), where the second inclusion 
holds since y is Borel. (i) now implies the result. 0 


3.2 Expectation of a product of independent r.v. 


3.2.1 Theorem. Let B andC be two sub-c-algebras of the probability space 
(Q,A,P). Then the following statements are equivalent: 
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(i) B and C are independent. 
(i) E(fg) =E(f)E(g) Wf € L°(B), g € L*(C). 


PROOF. Decompose f and g as f =u+E(f)1lg and g =v+E(g)1q. Then 
u€ HN L?(B) and v € HN L7(C). Moreover, 


(iii) E(fg) = E(uv) + B(f)E(g). 


since E(u 1g) = 0 and E(w 1g) = 0 if u, v € H. In view of (iii), (ii) is 
equivalent to 


E(uv) =0, Vue L?(B)NH, v € L2(C)NH:; 


that is, to the orthogonality of L7(B)H and L?7(C) NV H.0 


3.2.2 Proposition. Let B,,...,6,, be mutually independent sub-o-algebras 
of the probability space (Q,A,P). If f; € L~(B;), i =1,...,n, then 


E (11 i) = [J BW) 
i=1 i=1 


REMARK. The converse will be proved in 3.6.1. 


PROOF. We proceed by induction on n. Assume that the theorem has been 
proved for q < n and let [J/_, fi =h. 

Let By denote the c-algebra generated by {f, | (Br) :2<i<n}. Then 
h € L®(By) and, since f7'(BR) and By are independent by 3.0.2, it 
follows from 3.2.1 that 


B(hfi) = E(h)E(f:). 
We conclude by using the induction hypothesis E(h) = |}. E(fi). 


3.2.3 Corollary. Let fy,.... fn € L'(Q,A,P) and let h =[];_, fi. If the 
fi are independent, then 


heL'(Q,A,P) and E(h) = [leW. 


PROOF. We first prove the corollary under the hypothesis f; > 0,i=1,...,n. 
Let T, be the truncation operator. By 3.1(iv), the T,(fi) are independent; 


by 3.2.2, 
E (1 nus) = [[ ea) < TTB) = 


Set uy = [],Tu(fi). Then {u,} is an increasing sequence and E(u,) < M; 
hence Fatou-Beppo Levi implies that limug = h € L' and lim E(uq) = E(h). 
The general case is reduced to this special case by writing 


fi=fP—fi, where f? =f =sup(fi,0) and f) =f; = sup(—fi,0), 
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and expanding the product 


TL = SoCo [Tee 


a 


Since the f*' are nonnegative, 


E (0 i) = |[EU?». 


As the sum of 2” functions in L', [] fi is in L’. 0 


3.3 Conditional expectation and independence 


3.3.1 Theorem. Let (Q,A,P) be a probability space and let B and C be 
two sub-o-algebras. Then the following two statements are equivalent: 


(i) The o-algebras B andC are independent. 
(ii) BB(f)=E(f) vf e D(C). 


REMARK. The roles of B and C can be interchanged for a different formu- 
lation of (ii). 

This statement can be given the following concrete interpretation. If B 
and C are independent, then “knowledge of the events in the o-algebra B” 
in no way improves the “mean value” of a C-measurable r.v. 


PrRooF. (i) = (ii). Assume that f € L?(C). Set 


f=f-E(f)lo. 
Then f € H and 


E°(f) = E“(f) — E(f)lo = f -E(f)lo =f, 
whence f € L?(C)NH. 
By (i), f is orthogonal to L?(B); thus EF (f) = 0, or 


BP (f) = E(f)lo, 
implying (ii). 
When f € L!, we use the truncation operator and pass to the limit. 
(ii) > (i). Let f € L2(C) NH. Then, by (ii), EF(f) = 0. That is, every 
f in L?(C) NH is orthogonal to L?(B), and it follows that B and C are 
independent. 0 


3.3.2 Corollary. Let B and C be independent sub-c-algebras of the proba- 
bility space (Q,A,P). Then 


EFF = B. 
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Proor. Let f € L'(Q,.A, P). Then E& f € £1(C). Set u = E°(f); then 


E(u) = E(E°(f)) = E(f). 


Since u € L1(C), it follows from 3.3.1 that EP (u) = E(u) = E(f). 


3.4 Independence and distributions 
(case of two random variables) 


3.4.1 Theorem. Let X, and Xo be two r.v. defined on the probability space 
(Q,.A, P). Let py and pg denote the distributions of X, and X2, respectively, 
and let 4 denote their joint distribution. Then the following statements are 
equivalent: 


(i) X, and X2 are independent r.v. 
(ii) For all bounded Borel functions 1, po defined on R, 


E(vi(X1)y2(X2)) = B(yi(41))E(y2(X2)). 


(itt) fe = py @ fly. 
Proor. (i) & (ii). Let B; = X;1'(Br). Then the independence of the 
o-algebras B, and Bg is equivalent to that of the r.v. X, and X92. 

Let f; € L?(B;). By 1.5.4, the functional dependence theorem, there exist 
Borel functions 7; : R — R such that 


Hence (i) is equivalent, by 3.2.1, to 
E(w (X1))B(W2(X2)) = E(w (X1)¥2(X2)) 


for all Borel functions y); such that w;(X;) € L?. 

Using the truncation operator shows that this last condition is equivalent 
to the more restrictive condition that y; be a bounded Borel function; that 
is, to (ii). 

(ii) = (iii). Let C, D € Br. Set gy =1¢ and w = 1p. Then, computing 
the expectations by means of the distributions, 


sh lox pd = E(1c(X1)1p(X2)). 


| Ledp = E(1¢(X1)), 
R 


whence, using (ii), 
p(C x D) = pi (C)p2(D). 
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Since p: is a Borel measure on R? and Baz = Br ® Br, this last relation 
shows by IL-8 that pp = p41 @ pe. 

(iii) > (ii). Again using the distributions to compute the expectations, 
we have 


E(p1(X1)p2(X2)) = i: g1(E1)p2(E2)dpe (1) ® dui2(E2). 


By Fubini’s theorem, this is equal to 


[ eedduntes) | val€a)dual a) 


and (ii) is proved. 0 


3.5 A function space on the o-algebra 
generated by two a-algebras 


3.5.1 Theorem. Let B andC be two sub-c-algebras of the probability space 
(Q,A,P) and let D denote the a-algebra they generate. Let V be the vector 
subspace of L(A) defined by 


V= {? EL*(A):h= > figi. with fi © L*(B), gi € i=} . 


i=l 
Then L?(D) > V and V is dense in L?(D). 
PROOF. We prove the theorem in the special case that there exist two 
mappings u: Q — R” and v: 2 — R? such that 
) u'(Brx)=B and v!(Brr) =C. 


Let w: Q > R"*? be defined by w(w) = (u(w), v(w)). Then w7! (Brant) 
is a g-algebra containing B and C. 
Moreover, by J-2.4.2, 


Brot _ Bron @ Brp. 


Hence Ban+p is generated by the rectangles R= X x Y, with X € Bro, 
Y € Bre. We have 
w!(R) {w:u(w) € X and v(w) € Y} 
u(X)nv(Y). 


That is, w~!(R) € D. With the hypothesis (i), we have thus shown that 


(ii) D = wo (Bans): 
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Let p be the distribution of w and let w* be the inverse image mapping. 
Then it follows from 1.7.2 that 


(tit) w*: L°(R"'P, p) > L?(D), 


and the mapping is a surjective isometry. 

The continuous functions with compact support, Cx(R”*”), are dense 
in L?(R"*?; p). (See II-3.) 

Let y € Cx (R"*”). Then, by the Stone-Weierstrass theorem, there exists 
a sequence of polynomials P, converging uniformly to y on a compact set 
kk, x K2 which contains the support of y. Let g, = P,1x,.«,- Then 


(iv) l9r — vllz2(p) 9. 
We now show that 
(v) w* (gr) € V. 
This follows since the polynomial P, is the sum of monomials of the form 
(ul) 2 (ur)™ (0, )% 2. (uP) e 
and, setting 
f=1xn,(u')™...(u)™ and = g =1x,(v)™... (u?)%, 


we can write w*(g,) as a linear combination of functions of the form fag. 
Thus (v) holds. 

Since w* is an isometry, w*(Cx(R"*)) is dense in L?(D), and to the 
convergence of g, to y in L?(p) there corresponds a convergence in L?(D). 


(vi) REMARK. To prove the theorem without the hypothesis (i), we would con- 
sider finite systems of B-measurable functions w1,..., Un and C-measurable func- 
tions 1,...,Uqg. Then B could be viewed as the o-algebra generated by all the 
u'(BR»), and similarly for C. We would then “pass to the limit”. This passage 
to the limit will be carried out in detail for closely related cases in Section 6 of 
this chapter. 


3.5.2 Corollary. Let B,,...,Bn be a finite collection of sub-o-algebras of 
the probability space (Q,A,P) and let D be the o-algebra generated by 
By,...,By. Set 


q 

Wi = {» €L™(A):h= So fif2... ft, with fie va) 
p=l1 

Then W,, © L?(D) and W,, is dense in L?(D). 


PROOF. We proceed by induction on n. Let C be the o-algebra generated by 
Bz,...,B,. Then, by the induction hypothesis, 


(2) Wn-1 is dense in L?(C). 
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The o-algebra generated by Bi,...,Bn equals the o-algebra generated by B; 
and C. Let 


V={h:h=)° figi, with fi € L*(Bi), gi € L(C)}. 
Then, by 3.5.2, V is dense in L?(D). Let 
(ii) ‘= {h:h=)~ figi, where fi € L*(Bi), gi € L?(C)}. 
Then V’ Cc L?(D), and V’ is dense in L?(D) since V’ D> V. By (i), each g; can 


be approximated by elements of W,,-1. Hence there exists a sequence k? € W,-1 
such that ||k? — gi||,2 > 0, and 


IS > fig: — So fai lln2 < So Wille Ilo — Bille. 


The right-hand side tends to zero, and we conclude by noting that )> fik? € Wn. 


3.6 Independence and distributions 
(case of n random variables) 


Theorem. Let B),...,B, be n sub-c-algebras of the probability space 
(Q,A,P). Then the following statements are equivalent: 


(i) By,...,By are mutually independent. 
(i) E((UjL, fi) = 1 Eh) for any fi € L (Bi). 


ProoF. Recall that the direction (i) => (ii) was proved in 3.2.2. We now 
prove that (ii) > (i). Let H be a subset of {1,...,n}, let H’ be the 
complement of H, and let C and C’ denote the o-algebras generated by 
{B,: i € H} and {B; : j € H’}, respectively. We must prove the indepen- 
dence of C and C’. By 3.2.1, this will follow from the identity 


(iti) E(gg') = E(g)E(g’) Vg € L7(C), g' € L?(C’). 


Using (ii), the function space constructed in 3.5.2 on the a-algebra gener- 
ated by C and C’, and bilinearity, it suffices to calculate 


E([[f [[f] =[[L£v) I] £”. 


i€¢H jEH’ t€H jE€H’ 


Using (ii) again and setting f’ = 1 if i € H’, we find that the first term on 
the right-hand side is E(g), and similarly the second is E(g’). This proves 
(iii). O 
3.6.2 Theorem. Let X1,...,Xn ben r.v. on the probability space (Q, A, P). 
Then the following statements are equivalent: 
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(i) The r.v. X, are mutually independent. 
(ti) For all bounded Borel functions pp on R, 


n mh 
E (11 eu(%i)) = |] Ever (X,)). 
ke 
(iti) Let fs denote the joint distribution of X,.... X, and let fa; 
denote the distribution of X;. Then 
WA x Ag x... x An) = [J ai(Ai) 
i=1 


for any Aj € Br. In other words, fp = @1;. 


PROOF. The theorem was proved in 3.4.1 for n = 2. The general case is 
proved in the same way. with Theorem 3.2.1 replaced by Theorem 3.6.1. 0 


4 Characteristic Functions and Theorems 
on Convergence in Distribution 


4.1 The characteristic function of a random variable 


Let (Q,.A, P) be a probability space on which the R”-valued r.v. 


is defined. The characteristic function of the r.v. X is the function defined 
on R” = {(t,..., ty) } by 


px (t1, t2,...,tn) = E(exp[i(t: X1 + t2X?74+...+4,X™)]), 


where 7 = /—1. Since the imaginary exponential is a function with modulus 
1, the expectation of the right-hand side exists for every t € R”. 
4.1.1 Determining the distribution from its characteristic function 
Proposition. Let (Q,A,P) and (Q’, A’, P’) be probability spaces and let 
X and X’ be R"-valued r.v. Then statements (i) and (ti) are equivalent. 
(i) px(t)=px(t), VteR". 
(it) X and X’ have the same distribution. 


PrRooF. Let ys and py’ be the distributions of X and X’. 
Calculating the expectations by means of the distributions, we obtain 


exit) = | e“du(a) and ext) =f edu (x). 


n 
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(iit) px(t) = A(t), 
where jz denotes the Fourier transform of p. 
But it was shown in III-2.6 that two measures with the same Fourier 
transform coincide. O 
4.1.2 Convergence in distribution and characteristic functions 


Theorem (Paul Lévy). Let {X,} be a sequence of R"-valued r.v. defined 
on different probability spaces. Then the following statements are equiva- 
lent: 


(i) {Xp} converges in distribution. 
(ii) The functions px, (t) converge uniformly on compact sets. 


Moreover, if (ti) holds, let 
w(t) = lim yx, (t). 


Then there exists a positive Radon measure v of total mass 1 on R” such 
that v(t) = w(t) and the distributions of the Xp converge to v. 


PROOF. We first prove that (ii) = (i). Let yz» denote the distribution of 
Xp. 


(iit) Vague convergence of the Ly. 


Consider the linear functionals 1, on Co(R”) defined by 


Ip(u) = / Udfy.- 


Then |/,(w)| < ||ul|co. Moreover, by Parseval’s lemma (III-2.6), 
1 ry n 
Wf) = gap | Fex,(-bdt, vf € A(R”). 
Since pe € L', we can apply the dominated convergence theorem to obtain 
‘ 1 ~ 
limp) = ve f FlOw(tae 


Since A(R”) is dense in Cy(R”) (cf. III-2.5), II-6.8(iii) can be applied to 
show that there exists v € M{(R") such that the jz, converge weakly to 
v; that is, 


tim f udjy = [uty Vu € Co(R"). 
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- Aom—nat = f Fowl 


Since (A(R"))* = A(R") is dense in Co(R"), it follows that p(t) = w(t). 


Furthermore, 


(iv) Narrow convergence of the [p. 


Only the pointwise convergence of yx,(t) was used to prove (iii). We 
must now exploit uniform convergence. Let 


eee lel? 
Cal) = Gaye ep(- 2X ). 


Then ‘ 
(Ga)*(t) = exo(~Fe8?).. 


Consider the following integral of the nonnegative function (1— G ») with 
respect to the positive measure dtp: 


ee / (1 — G(x) dup (2). 


Writing Parseval's relation and taking into account that (G)(t) 
= G)(-t) = G)(t) and that f,(0) = 1, we obtain 


Ip = 1 = fin (t)Ga(tat 


Since f Gy = 1, this can be written 


Ip = f= fiplt))Ga(that = / at / aes 


where 77 is determined by first fixing q such that |fZ,(t) — fg(t)| <€ifp > q 
and ||t\| < 1, then choosing 7 < 1 such that |fig(t) — jig(0)| < € if ||¢l] < 7. 
Then 


(v) lMp(t) — 1] < 8e if [ltl] <n, 


whence 
0< Ih p < 3¢€ G42 f G). 
R” tll > 


The first integral equals 1; the second tends to zero as » tends to zero, for 
fixed n. Hence there exists A» such that |Z), »| < 4e for every p > q. Let 


h € C)(R") and set u = AG),; then u € Co(R”) and, by (iii), 


if ud[, > / udv. 
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Since 1 — G), is nonnegative, 


| [eu Wduy 


Moreover, /(t) = w(t), the limit of the jip(t), satisfies (v). Similarly, 
| : (u — v)dv 


tim fh - [haw Vh € Cy(R”). 


In particular, taking h = 1 shows that v(R”) = 1; that is, v is a prob- 
ability measure and the X, converge in distribution to the distribution v. 
This proves (i). 


< [hllo.llt — Gaollez = lll, / (1— Gy) dtp < 4ellhllcy- 


< 4ellh|lc,, 


and finally 


PROOF OF (i) => (ii). By the definition of narrow convergence, 


1 e "dyp(x) > i e’*dv(z) 


for every fixed t. We must now prove uniform convergence in t. By II-6.8((ii) > 
(iv)), given « > 0 there exists M such that p»([-M, M]°) < « for p sufficiently 
large. Then 


+M 
px,(t) = i e*dip(x) + Oe, where |6| <e. 
-M 


Differentiating with respect to t under the integral sign shows that the first partial 
derivatives of yx, are bounded by M. Hence the yx,(t) are equicontinuous 
functions, and the result follows by Ascoli’s theorem that pointwise convergence 
on a compact set implies uniform convergence.” O 


4.1.3 Differentiability of characteristic functions 
Proposition. Let X be ar.v. with values in R”. Suppose that 
E(||X|[Rn) < co, where p>1. 


Then px is r times continuously differentiable in t for r < p and 


(i) (4) oe (2) ox) (t) = E[(iX1)™ ... GX") et]. 


PRooF. Using the criterion for differentiation under the integral sign (I-7), 
we have 


a 
5p, Elexpli(taX" +...4+t,X")]) = EX! exp[i(t: X*+...+tpX")]). 
A 


?See Bourbaki, General Topology, X.2.4. 
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The result follows by noting that 


XI] © LP = (OYA) (XY EL) if m treat... ttn <p. 


4.1.4 Taylor series expansion of a characteristic function at the origin 


Proposition. Let X be ar.v. with values in R” and suppose that ||X||Rx € 
L?. Then 


(i) px(t) =1+iB(X).t ~ 5ax(t) + o(liP), 


where qx(t) = dl akJt,t; and ak = E(X*X!). 
The matriz a’) is symmetric and nonnegative; that is, 


(zt) qx(t)>0O for every tER”. 


PROOF. Since ||X||R» € L?, 4.1.3 implies that yx is twice continuously 
differentiable. The derivatives at the origin can be computed using 4.1.3(i), 
and (i) follows by using Taylor’s formula with remainder. 

Moreover, 


2 
gx(t) = ba tytjE(X*X/) =E (x4) >0. 


kj 


4.1.5 Definitions. X is said to be centered if E(X) = 0. 

If X is not centered, a centered r.v. is obtained by setting Y = X — 
E(X)1g. The quadratic form qy(t) associated with the centered variable 
is called the covariance of X and written ox (t). 


4.2 Characteristic function of a sum of independent r.v. 


4.2.1 Proposition. Let X;,...,X, be mutually independent R”-valued 
r.v. on the probability space (Q,.A,P). Let 


(2) px, (t) = E(e**) 


be their characteristic functions, and set 


(ii) Se Rae 4 Ks 

Then 

Ca) vs(t) = [[ ox.) 
k=1 


PROOF. ys(t) = E(e!-Xtett-X2 || eit Xr), 
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Using 3.2.2 with f, = e**, we obtain 
ps(t) = T[ Be) = [] ox, (#).0 
k k 


4.2.2 Corollary. With the notation of 4.2.1, let 11,..., bp be the distri- 
butions of X1,...,Xp and let v be the distribution of S. Then 


VY = [1 * M2 **+* Mp. 


PROOF. Using 4.1.1(iii), we may write 4.2.1(iii) in the form 


a 


D = jix(t)fa(t) ...fip(t). 


By III-1.4.4, the convolution product of measures corresponds to the prod- 
uct of the Fourier transforms. 0 


4.2.3 Proposition. Let X1,..., Xp be independent R” -valued r.v. Suppose 
that ||Xx||lpn € L?,1<k <p, and let S = X,+---X,. Then the covariance 
forms are related by 


ogs(t) = So ox, (t). 
k=1 


PROOF. Setting Xp, = Xp - E(X;)1q, we can reduce the proof to the case 
where the X, are centered; then S is centered. We must verify the identity 


qs(t) = Dhar 9x, (t), oF 


2 2 


E[ [| >0tS?] | =E] (Sot. So xP] | =So ete [SO xXPX!, 
dD. k j kl 


jim 


But, for 7 # m, Xx} and X! are independent r.v. by 3.1(iv). Hence, by 
3.2.1, 


E(X}X},) = E(X*)E(X/,) =0. 


Thus 


qs(t) = So tet S > E(X}X}) a S— ax, (t).0 
kl j j 
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4.8  Laplace’s theorem and Gaussian distributions 


4.3.1 Laplace’s theorem. Let X;,X2,...,Xp,... be a sequence of inde- 
pendent R”-valued r.v. defined on the probability space (Q, A, P). 

Suppose that the X, all have the same distribution, that ||Xi||R» € L?, 
and that E(X,) = 0. Set 


1 
Gn = FelXi to + Xn): 


Then the sequence of r.v. Gy, converges in distribution to a r.v. G with 
characteristic function 


v= (ol $4) -#(e0( 48) =r (5) 


By 4.2.1, ys, (t) = (yx, (t))”. Hence 


ya(t) = lex, (=)] . 


0G, (t) = exp (nog (ox, (=z) )) 


(Note that yx,(0) = 1. By continuity, there exists « > 0 such that, for 
lt] < €, |px,(t)| A O and —F < argyx,(t) < 5. Thus log yx, (¢) is well 
defined for |t| < €.) Furthermore, 


PX (=z) ae ag 5a (t +o (=) 


uniformly in t, when t ranges over a compact subset of R”. Hence 


log px, (=z) = ~ sax, (t) +0 () , 


and yq,,(t) > exp(—$4@x, (¢)) uniformly on compact sets. 4.1.2 implies the 
result. O 


or 


4.3.2 Gaussian distributions 


With the next few results, we make Laplace’s theorem more explicit by 
computing the distribution of G. 


4 Characteristic Functions and Theorems on Convergence in Distribution 205 


2 


(i) Lemma. [ex (-5) dt = (2n)*/?. 


PROOF. We prove this well-known result by using a trick from real variables. 
Let J = 1 exp(-5) dt. Then, by Fubini’s theorem, 


2 2 
Pe fe eo(—5} eo(—5) dudv. 


Passing to polar coordinates, let u = rsin@ and v = rcos@, with r > 0 and 
0<@< 2n. 
This change of coordinates defines a diffeomorphism of R? with Jacobian ma- 


trix 
Pe sin@? —cos@ 
~ \ reos@ —rsing }° 


Since |det J| =r dr dé, 


Qr +00 2 +00 r2 
raf | e(— 5) ra ao=2n | exp(—5) rar 
6 5 2 ‘ 2 


The last integral can be computed by setting r? = w. Thus 


+00 r2 +00 
i exp| —— | rdr= i exp(—w)dw = 1.0 
0 2 0 


t? x 
r, i _ 
(ii) Lemma. aml exp (ite — 5) dt = exp(-F). 


PROOF. Let 7 be an auxiliary parameter defined by p,(t) = CEE. exp(-£). 
It is straightforward to verify that 


Op, _10°p, 
@) dr 2 OF 


Note that z+> p,;(x) is an element of the space S(R). By IJII-4.2, differentiation 
with respect to x is mapped to multiplication by —it of the Fourier transform: 


pr(t) = | p,(x) exp(itr)dz. 
R 
By I-7.8.4, we can differentiate under the integral sign; thus (i) can be written 
. Oe tc 
—},(t) = -~p,(t). 
(2) 3, Pr ) 2 pr(t) 
Note that, as t — 0, p,(t)dt converges narrowly to the Dirac measure at zero. 


Hence p;(t) — 1 for each fixed t as rT — 0. The differential equation (ii) thus 
gives p;(t) = exp(—$7); the lemma follows by setting 7 = 1. 0 
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(iii) Lemma. Let Q(t) = (t1)? +--+ (tn)?. Then 


ons i exp [-52t + ita dx = exp| 5200] 


PROOF. Since exp{—4 (x)| =Thes exp[—$(x*)?], the conclusion follows 
from (ii) and Fubini’s theorem. G 


4.3.3 Definition. A Gaussian distribution is a measure on R” with 
Fourier transform j of the form 


(i) jit) = exp (-3H00) | 


where h(t) is a positive quadratic form. 


4.3.4 Proposition. Let j: be a Gaussian distribution given by 4.3.3(i). 
Suppose that h is positive definite. Then 


1 
dp = cexp [Satay dx, 
where c is a normalizing constant such that [du = 1 and hi(x) is the 
adjoint of h, defined by 
(it) hi(x) = sup{t.a : h(t) < 1}. 


PRooF. Let a basis be chosen such that A(t) = S> t?; then 4.3.2(iii) implies 
(i) with hi(z) = So zz. Using formula (ii), hi(x) can be defined without 
changing bases. 0 
4.3.5 Proposition. Let 1. be a Gaussian distribution of the form 4.3.3(i). 
Let 


V={t:h(t)=0} and V+={a2:ta2=0 Vte V}. 
Then p is a measure with support V+. Let y € V+ and let dy be the volume 
measure on V+. Then 


1 
du =c exp [-3f w) dy, 


where h, is the quadratic form defined for ye V+ by 
hi(y) = sup{t.y : A(t) < 1}. 
REMARK. The quadratic form h, is positive definite. 
PrRooF. Let xz, t € R” be decomposed as 
r=yt+2, where yeEeVv-,zeV; 
t=n+¢, where 7EV~,CEYV. 


fa hes 1 
peter an = of e'Y exp [-3hn (| dy. 
vt 2 


Then 


oO 
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5 Theorems on Convergence of Martingales 


5.1 Martingales 
5.1.1 Definition of a filtration 


Let (Q,.A, P) be a probability space. A filtration of the space is an increasing 
sequence {A,} of sub-c-algebras of A such that 


Ap CALC... CAn C.... 


Let A, be the a-algebra generated by all the A,; we write A,, = lim An. 
The filtering sequence is said to converge to A if Ay = A. 

The phenomenological meaning of an increasing sequence of o-algebras is 
clear. Let 0,1,2,..., be the various instants of an “experiment” . 

Let A’, be the Boolean algebra generated by all the observations made 
up to time n (in the sense of 1.1). Then Aj, encapsulates all the experi- 
menter’s knowledge of the system at time n. The o-algebra generated by 
A’, is written A,, and might be called the o-algebra of the past at time n. 


5.1.2 Sequence of r.v. adapted to a filtration 


Let (Q,.A, P) be a probability space equipped with a filtration A,. A se- 
quence of r.v. {X,,} in L°(Q,.A) is said to be adapted to the filtration if 
SEO Ag) 


5.1.3 Given a sequence {Yj} in L°(Q, A), let Af be the o-algebra generated 
by Y=-1(Br), where s < k. 

Then the Ay form a filtration of (0,A,P). Moreover, the sequence of 
r.v. Yn, is adapted to the filtration A, if and only if An D AY for any n. 


5.1.4 REMARK. A, might be called the o-algebra of the past corresponding 
to the “experiment” that consists of observing the values of Yi(w),..., Yn (w). 


5.1.5 Definition of a martingale 


Definition. Let (Q,.A, P) be a probability space equipped with a filtration 
{A,}. A sequence {X,,} of r.v. is called a martingale if 


(i) the X, are integrable: X, € L1(Q, A); 
(ii) the sequence {X,} is adapted to the filtration {Ay}; and 
(iii) EA"(Xy41) = Xn, n 2h 


5.1.6 Proposition. [f {X,,} is a martingale, then 
E“*(Xnip) =Xn. Wn and Vp > 0. 


PROOF. Since An C An+i C Anta C.-. C Antp-1, it follows from 2.3(iii) that 


BA» = pan pA | BAnte 1, 
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By 5.1.5(iii), BA+e-1(Xn45) = Xntp-1, EA te-2(Xnape1) = Xnap-aye--s 
and finally 


pAn (Xn4i) = Xn. 


5.2 Energy equality 


5.2.1 Proposition. Let {X;} be a martingale relative to the filtration 
{Ax}, and assume that X;, € L7(Q,A) (1 <k <n). Then, for n> p, 


n-1 


E(X7) BCX) = 2B Xj41 — X5)). 


PROOF. Set ej = Xj41 — X;. Then, for m < j, 
(i) B46 (e;) = EA BA(X,,, - X;) = EA" (x; - Xj) =0. 
Writing Xn = Xp, + oj |e; and expanding X2, we obtain 


n-1 n-1 
E(X;) = E(Xp) + 9 B(e7) + S> B(ejey) +2 5) E(Xpe;). 
J=P j' J=pP 
JAS 

We now show that all the terms appearing in the last two sums are 
zero. Assume that j < j’. By 2.1.3, E(eje;) = E(E“+ (e;¢;")). We now 
use the fact that EAs+ (6;¢;1) = ejE’™*'(e;-). Since 7 < 7’, (i) implies 
that EA+ (6,7) = 0, whence EAs+1 (6 ;¢;) = 0. Similarly, E(Xpe;) = 
EE“: (X,¢)) = E(X,)E“(e;)) =0. 


5.2.2 Corollary. Let {X,,} be a martingale. Then E(X2) is an increasing 
sequence. 


PROOF. Apply the energy equality with p= n-— 1. 


5.3 Theory of L? martingales 
5.3.1 Definition. {X;,} is called an L? martingale if 


sup E(X7) < +00. 


It follows from 5.2.2 that 


5.3.2 lim E(X?) exists and is finite. 


5 Theorems on Convergence of Martingales 209 


5.3.3 Structure theorem. Let {X;} be an L? martingale. Then there 
exists Xo. € L?(Q,A0) such that 


(2) Xx > Xoo|lz2 —0 and 


(ii) X, = E“*(Xx,,). 


Xoo is called the final value of the martingale. 
Conversely, let ...Gn C Gn41 C ... be an increasing sequence of 
a-algebras on (Q,.A), let Goo be the o-algebra generated by all the Gy, let 


f € L?(Q,G.0,P), and let Y, = E9*(f). Then 


(iit) Y, is an L? martingale 
and 
(iv) Yn — fllz2 — 0. 


PROOF. We first prove the following lemma. 


5.3.4 Lemma. E((Xn+p — Xn)”) = E(X? 


re) — E(X2). 


PROOF. 
E((Xntp — Xn)?) = E(X2,,) + E(X2) - 2E(Xn4pXn) and 
E(Xn4pXn) = EEA" (Xn4pXn) = E(XnE“*(Xntp)) = E(X3).0 


PROOF OF THE THEOREM. Since the sequence a, = E(X2) is convergent 
by hypothesis, Ve > 0 dno Vp > 0 Qn4p — Qn < €. By 5.3.4, 


|Xntp — Xnl|lz2 <€, Vn > no and Vp > 0. 


Thus X;, is a Cauchy sequence, which converges since L? is complete. More- 
over, X_ = EA (Xx4r) for all r > 0 by 5.1.6. 


Let r > +oo. Then Xia, > Xoo in L?; hence E“*(X;,4,) + E“*(X,.), 
and (ii) follows. 
We now prove the converse. By 2.3.3(iii), 


EO BG n+% = BIs, 
Applying this to f, we obtain 
E9> (B+ (f)) = EM (f), 


or 
E9> (Yanan) = Yo- 
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Furthermore, since the projection operator Eg» has norm < 1, 


Yale = NES (A)ilce < lle. 


The sequence {Y,,} is an L? martingale. By the first part of the theorem, 
Afs. € L? such that 


Yn — foollz2 — 0. 
By the lemma, f = f,, and this completes the proof of the theorem. 0 


5.3.5 Lemma. Let (2,.A,P) be a probability space filtered by {An}. Set 
Ax = lim A, (in the sense of 5.1.1). Then 


Bf - BAS fle 0 for every fel? 


PROOF. We write V, for L?(Q,An), a closed subspace of L?(Q,A). Let Vac = 
UnVn and let Ty, and ly, denote the respective orthogonal projections. Then 
Hilbert space theory shows that 


Iv, f — Tv. fllz2 = 0. 


Moreover, L?(Q,Asx) D Vn for all n. Since V3. is the smallest closed subspace of 
L? that contains all the Vp, L?(Q, Aso) D Veo. 

Now let B € Ax. If we show that 1g € Vx, the density of the simple functions 
in L?(Q, Asc) will imply that L?(Q,A.) C Voc and hence that 


(i) L?(Q, Acc) = Voc. 


To prove this, let B denote the set of subsets B of Q such that B € A, for some 
s. From the set-theoretic point of view in P(Q), B = U,As. 

Then B is a Boolean algebra and B C Ax. By I-1.4, Ax is the monotone class 
generated by B. 

If B € B, then 1g € Vx. Let M denote the class of subsets D of 2 with 
indicator function satisfying 1p € Voo. 

We now show that M is a monotone class. Let D, be an increasing sequence 
of elements of M, with limit D,.. Then 1p, — 1p... everywhere, and by the 
dominated convergence theorem ||1p,, — 1p... ||;2 — 0. 

Since P(Q) = 1, the analogous result for decreasing sequences follows by taking 
complements. Thus M is a monotone class and M = Aso, so (i) is true and the 
lemma is proved. 


5.4 Stopping times and the maximal inequality 
5.4.1 Definition of stopping time 


Let {X;,} be a martingale defined on the space (Q, A, P) filtered by {.A,}. 
A stopping time T(w) is a function on Q, with strictly positive integer 
values, such that 


(2) Ary ={w:T(w)>p}eAp VpeEN. 
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(ii) Proposition. If T! and T? are two stopping times, then 
T?(w) = inf(T" (w), T?(w)) =per (T* AT?)(w) 
is a stopping time. 


PROOF. ATs p = Afi p N AT? ,p € Ap. 


(iii) Any given time g can be thought of as a stopping time. 


5.4.2 Truncated martingales 


Definition. Given a martingale {X,,} and a stopping time T, the truncated 
martingale is defined by 


Xp (w) = X7(w)an(w)- 


We proceed to justify this terminology by showing that {X7} is a mar- 
tingale. Since 


m— 


(2) Xj41—- Xj)lap; + X11 
2. 


and all the functions on se right-hand side are A,,-measurable, x. is An- 


measurable. Moreover, E A, (X7",,) can be computed by observing that, on 
the right-hand side of (i), all the functions except X,+41 are A,-measurable. 
Thus 


BA» (x2, ,) = (BA" (Xai) — Xn)laz, + XE. 


But 
E* (Xn41) = Xn, 
whence 


E+ (Xt) = X70 


n 


5.4.3 Definition of the maximal function 


Let {Y,,} be a martingale, let 


Y; = sup lYp|, 
l<p<n 
and let 
Y*= lim Y;. 
n—-+00 


Y* is called the mazimal function of the martingale {Y,,}. 
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5.4.4 Doob’s maximal inequality 


Proposition. Let {Y,,} be a martingale on the space (Q,A, P} filtered by 
{A,,}. Then, for every constant y > 0, 


(i) P(YE >) < = (B((¥al) +E(\Yi)] (n =0,1,...) 
and 
(ii) P(Y*>7)< = sup B(|¥a)): 


ProoF. Let At = {w : suppcn Yp(w) = y} and let 


T(w) inf{p: Yp(w) >y} if we Ay 
Tw) = n if w¢ At. 


Then T(w) <n. Moreover, 
{w:T(w) > q} =Up<q{w: Yow) <y} eA, if q<n 


and 
{w:T(w) >n}=90. 
Thus T is a stopping time; let {Y,"} be the martingale truncated by T. 
Then E(Y,7) = E(¥1) (since YJ = Y,) and 
B(Y,7) = E(¥2 lren) + E(¥7 Iran). 


Y,? > y on the event {w : T(w) < n} and Y,7 =Y, on {w: T(w) = n}; 
hence 
E(¥. Ir<n) > YE(Ir<n) = yP(A%). 


Thus E(¥1) = E(¥,17r=n) = AP (AN), and 
¥P(AM) < E(|¥al) + EY). 


(i) follows by observing that 
{w: ¥i(w) > 7} = AZU {2 ; sup[-¥p] > v} ; 
p<n 


To prove (ii), it suffices to note that {Y,*} is an increasing sequence with 
limit Y*. Hence Y* > y = Ve > 0 Sn such that P(Y, > y —). Thus, by 


(i), 
PUY? oy) s 


sup E(|¥;|)-0 


OY SLE, 
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5.5 Convergence of regular martingales 

5.5.1 Definition of regular martingales 

Let {Y,} be a martingale relative to the filtration A, on (Q,A,P). {Yn} 
is called regular if there exists Z € L1(Q,.A, P) such that 


Yn, =E“*(Z) vn. 


EXAMPLE. Every L? martingale is regular by 5.3.3. 


5.5.2 Final value of a regular martingale 


Let A,, denote the o-algebra generated by the union of the o-algebras A, 
and set 
Yo = EA~ (2). 


Y5, is called the final value of the martingale {Y,,}. 


5.5.3 Theorem on L! convergence. Let {Y,} be a regular martingale 
and let Y,, be its final value. Then 


(i) Y, =E“*(y,,) and 
(it) E(l¥n — Yoo!) ~0 as n—- +00. 


Proor. Let yy(t) be the function introduced in I-6.7 to define the trun- 
cation operator and set Z”@ = yyy(Z). Then 


(iii) |Z" — Zp +0 as MO. 
Set Yn.ar = B4"(Z™). Then 
Yn = Yaz < |Z _ Z\||b- 


Thus ||Yn.az|lze < M, and hence {Y,,.7} is a martingale. Using 5.3.3 and 
5.3.5 and setting Yu = E4~(z™), we obtain 


(iv) Yn.m = E“*(Yoo,m) and ||Ya,m — Yoo,atllz2 > 0- 


(i) is proved by using (iii) and the first formula of (iv), then letting M tend 
to infinity. Similarly, since the L? convergence in (iv) implies L! convergence 
by the Cauchy-Schwarz inequality, (ii) follows for Y,,.7. Letting M — oo 
shows that (ii) holds for Y,. 0 


5.5.4 Proposition (Almost sure convergence). Let {Y,,} be a regular 
martingale and let Y,, be its final value. Then 


Yn(w) > Yoo(w) almost surely. 
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PrRooF. Let 6,(w) = sup,s, |[¥n(w) — ¥;(w)| and let G(w) = limg—. By(w). 
For fixed gq, let 
Zm = Lyot+m — Y, (m > 0). 


Then {Z,,} is a regular martingale relative to the filtration {Aji,} and 


sup ||Zml|z1 < |[Yoo — Yollnr + sup |[Yoo — Yq" lz. 
m. q'>4@ 


By 5.5.3(ii), the right-hand side is less than € if gq > qo. Hence, using the 
maximal inequality 5.4.4(ii), 


2 
P({w = Bylwe) >) < > if ¢> 40. 


Fixing y, let q — oo. Since {3,} is a decreasing sequence of functions, 
P({w: B(w) > y}) =0, whence S(w) =Oas. 


{Yn(w)} converges a.s. Let Zs. be its limit. Since {¥,,} converges in L! to Yoo, it 
has a subsequence {Yn, } that converges a.s. to Yoo; hence Zoo = Yoo. 


5.6 L' martingales 


5.6.1 Definition. A martingale {Y,,} is called an L! martingale if 


sup ||Ynllz1 < +00. 
n 


EXAMPLE. Every regular martingale is an L' martingale. 


5.6.2 Proposition. Let {Y,,} be an L! martingale. Let T; < To <... < 


Tj <... be an increasing sequence of stopping times such that, for every j, 
Tj(w) < +00 a.s. Let Yr,(w) = Y7,(.,)(w). 
Then is 


(Yr;,,(w) =) Yr, (w))? < +00 4.8. 
j=l 


PROOF. Set a = sup ||Yn||;1 and let Y* be the maximal function. Then, by 5.4.4, 
P(Y* > p) < 4ap~', whence 


(2) Y"(w) < +00 as. 


Fix p and let f be the continuously differentiable convex function defined by 
f(t) = #? if |t] < p and f(t) = 2plt| — p? if |t| > p. Let g be the nonnegative 
function defined by 

g(v1,v2) = f(v2) — f(vr) — (v2 — v1) f’ (1). 


Then g(v1, v2) = (v2 — v1)? if |u| < p (i = 1,2). 


(iz) E[f(Y¥n)] < 2pB(|¥n|) < 2pa 
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(wii) E((Yjui -¥/)F'(%)] = BEY = BL) BY Wn - ¥3)] = 0; 
E(f(¥)-f(%)) = E Fue) 020 
go 
= SCE(9(¥j+1;¥5)): 
+00 - 
# (Soi -¥0-<0 <  2pa 


Hence, letting p — +00 and using (i), 


+00 
(iv) D1 &) - ¥i(w))? < +00 as. 


j=l 


We now generalize this “local version” of 5.2.1 to an increasing sequence of 
stopping times T,; < T2 <...< Tj <.... Set Yr, (w) = Yr, (w)(w). We would like 
to show that 

+00 
(v) S-(¥ry41(w) — Yr, (w))? < +00 as. 


j=l 


Once (ii) and (iii) have been generalized, the same calculation will give (v). 
Letting Ar, = {w : Tj > q}, we have, as in 5.4.2, 


f(¥ry)—F0%4) = SOF a1) - Fs) Lary, , 
q=1 
+00 
E(f(¥r,)—f%)) = SlE(G(Yo+1, Ya)tar,,) SD 9(¥at1, Ya) < 2pa, 
q=l q 
whence 
(«i)’ E(f(¥z,)) < 4pa. 


Let Ar, denote the o-algebra generated by the Agn Te (q), where gq € N. Then 


(iii)! E45 (Yp,_, — Yr,)f/(¥r,)) = 0. 


This proves (v). 0 


5.6.3 Fatou’s theorem. Let Y,, be an L! martingale. Then limp... Yn(w) 
exists a.s. 


PROOF. For a proof by contradiction, assume that Fatou’s theorem fails; then 
there exists b > 0 such that 


(vt) G= {e : limsup |Yn(w) — Yn’ (w)| > 2} satisfies P(G) > 0. 


n,n! —00 
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Let T;(w) = 1 and let the later stopping times be defined recursively by 
Tj+1(w) = inf{q : ¢ > Tj(w) and |Yr, (w) — Yq(w)| > d}. 


Since the sequence T}(w) is increasing, |¥r,,, (w) — Yr; (w)| > b. This contradicts 
5.6.2. 


IMPORTANT REMARK. Nonzero L! martingales can be constructed with 
lim Y,(w) = 0 a.s. It is thus impossible to reconstruct the martingale from 
this limit, as was done for regular martingales. Hence the importance of 
the regularity criterion that will be given in Section 5.8. In Section 5.7, we 
will develop a concept that is both interesting for its own sake and crucial 
for stating the regularity criterion. 


5.7 Uniformly integrable sets 


5.7.1 Definition. A subset H of L' is called uniformly integrable if for 
every € > 0 there exists 7 > 0 such that E(|h| 14) < e for all h € H and 
for every A € A with P(A) < 7. 


5.7.2 Proposition. Let H be a subset of L'. Then the following two state- 
ments are equivalent: 


(i) H is uniformly integrable. 


(it) lim sup | |h| dP| =0. 
9° |heH J\hAl>q 


PROOF. To prove that (i) = (ii), we first show that (i) implies 
(iit) 4M <+0o such that |All, <M VAEH. 


Let 7 > 0 be the number associated with « = 1 by Definition 5.7.1. Then (iii) 
follows from setting M = a +1. 
By Chebyshev, P(|h| > q) < q7'M. Since this expression tends to zero as 
q — oo, (i) implies (ii) formally. 
We now prove that (ii) > (i). For a proof by contradiction, suppose that there 
exist €9 > 0 and sequences {hn} in H and {A,} in A such that 
E(hnla,,) > and P(A,) 0. 


Let qo be chosen so that 


/ hdP< S Vh € H. 
|hl>¢0 


Set Bn = {w: |hn(w)| > qo}. Then 


€o < E(hn1a, (1B, + 1g0)) < E(hn1a,,) + goP(An). 
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Since the first term on the right-hand side is less than < and the second tends 
to zero as n — oo, this gives a contradiction. 0 


5.7.3 Proposition. Let H be a uniformly integrable subset of L' and let 
Hy, be the closure of H in the topology of almost sure convergence. Then 
Ay 1s uniformly integrable. 


PROOF. Set 
y(e) =supE(|h| 14), where he H and P(A)<e. 


Let {h,,} be a sequence of elements of H which converges almost surely to 
ho. Fatou’s lemma implies that 


y(e) 2 liminf E(|h| 14) 2 E(|hol 14), 
whence 
sup E(|h| 14) < y(e), Whi € Hi, VA such that P(A) <.0 


5.7.4 Theorem (Generalization of Lebesgue’s dominated conver- 
gence theorem). Let {u,} be a sequence of integrable functions on a 
measure space (X,A, pu), u(X) < +00, such that 


(i) the family {u,,} is uniformly integrable and 
(ii) Un converges a.s. to uo. 
Then 
\lun — Uollzs > 9. 
PRooF. By Egoroff’s theorem, there exist « > 0 and B € A such that 


p(B°) < € and uy, converges uniformly to up on B. 
Then 


l|’n — uollz: < E(|un — uo] 1B) + E(\un| Lee) + E((|uol] 1c). 


The first term on the right-hand side tends to zero by uniform convergence, 
the second by uniform integrability, and the third by the same reasoning 
as in 5.7.3. 0 


5.8 Regularity criterion 


5.8.1 Theorem. Let {X,,} be an L! martingale. Then the following con- 
ditions are equivalent: 


(t) {Xn} is regular. 
(it) {X, 21 <n < oc} is uniformly integrable. 
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PROOF THAT (ii) > (i). We know by Fatou’s theorem (5.6.2) that X,(w) 


converges almost surely to Z. By 5.7.4, this implies that ||X, — Z||,1 — 0. 
Hence, using the identity 


fixing n, and letting q go to infinity, 


X, =E“(Z).0 


PROOF THAT (i) => (ii). For ac to be fixed later, set Bn = {w: |Xn(w)| > ch. 
Then, since Bn € An and |Xn| < EA: (|Z), 


E(|Xn] 12,.) < E((1e, B4" (|Z|)) = B(E“" (jZ))12,,)) = E(|Z] 12,.). 


Hence, with 6 also to be fixed later, 


i |>ec 


Xa dP aoa, \2| aP 
|Xn|>c 


= / \Z| a+ [ |Z| dP 
(IXnl>e}{|Z1>0} {IXn|>e}N{iZ|<b} 


|Z| dP + bP(|Xn| >). 


{Z|>b 


1A 


But, by Chebyshev’s inequality, P(|Xn| > c) < 4E(|Xn|) < +E(|Z|), whence 


/ |Xn| dP < ‘| |Z| dP + eRZ). 
|[Xn|>e |Z|>b C 


Let b = q'/? and c = q; then the right-hand side tends to zero as gq — oo, and 
the conclusion follows by 5.7.2. 


6 Theory of Differentiation 


If f is a continuous function defined on [0,1] C R and F(x) = fj f, then 
F is differentiable for every x and F’(x) = f(x). The same result holds for 
f € L, provided that “for every x” is replaced by “almost everywhere” ; 
this is another theorem of Lebesgue. 

The derivative is computed as the limit of quotients of the form 


(i) HlF(a +6) ~ Fle] = qe) 


where v is Lebesgue measure, p(A) = fi f, and A, = [z,x+4 é. 
In this section, we study the limits of quotients of the form (i) on an ab- 
stract measure space. A.s. convergence will be obtained for an appropriate 
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choice of the A,: the A, will be the atoms of an increasing sequence A, of 
finite sub-o-algebras of A, “converging to A”. 

Quotients of the form (i), which thus form a martingale for the filtration 
An, will be used to prove the Radon-Nikodym theorem. 

Conditional probabilities can immediately be defined for conditionings by 
finite o-algebras; the existence of conditional distributions in the general 
case will depend essentially on a convergence theorem for vector-valued 
martingales. The convergence of such martingales will be clear for Radon 
measures. A structure theorem will allow all separable measure spaces to 
be realized by means of Lebesgue measure on R. 


6.0 Separability 


The measure space (X,.A,,) is called separable if there exists a sequence 
that is dense in Li: in other words, if Lh, satisfies the first separability 
axiom I-2.4.1(i). 


Consider the case of Radon measures on a compact space Y. If Y is metrizable, 
then C(Y) satisfies the first separability axiom and, since C(Y) is dense in Li, 
the same holds for re The same result is true if Y is locally compact, metrizable, 
and the countable union of compact sets. 


6.1 Separability and approximation by finite a-algebras 


Proposition. A measure space (Q,A,P) is separable if and only if there 
exists an increasing sequence of o-algebras Ay C Ag C ... C An... such 
that each o-algebra A, is finite and 


EA» (fF) —f forevery fe L'(Q,A,P). 


The sequence of a-algebras A, is said to P-generate A. 


Proor. Assume that (9,4, P) is separable, and let fi,...,fn,... be a 
dense sequence in L!. Approximating each f, by a sequence of simple 
functions gives a countable family ® of simple functions which is dense in 
hee 

Let g1,..-,9n,--- be an enumeration of this sequence and let A; be the 
o-algebra generated by g, '(A), 1 <k <i. 

With each f € L’ we now associate a sequence {X;,} defined by 


EX“: f = X;,. 


Then {X;} is a regular martingale, which converges in L! by 5.5.3. Let 
Xoo = lim Xx. Since ||Xol|z1 < || f||z1, a bounded operator 7: L! — L! 
can be defined by setting m(f) = X., and 

EA x. = lim EA*EA f= BA = X,. 


q7 oe 
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That is, 7? = 7. The image V of 7 is closed, for if un = (fn) and un — uo, 
then since tu, = Uy, and 7 is continuous it follows that mug = uo. V is 
thus a closed vector subspace of L1(X, A, 1). 

Let fo be a simple function in ®. Then fo € L°(A,) for k sufficiently 


large, and hence EA (fo) = fo. Thus fo € V. Since the family ® of simple 
functions is dense in L!, it follows that V = L!. 


The proof in the other direction is clear. For each k, EA (ZL?) is a finite- 
dimensional subspace of L' and hence separable. The union of these spaces 
is separable and dense in L!. 0 


6.2. The Radon-Nikodym theorem 


6.2.1 Theorem. Let (Q,.A,) be a separable measure space and let 4 and 
py be finite measures defined on A. Then the following statements are equiv- 
alent: 


(i) For every AE A, p(A) =0 => (A) = 0. 
(ii) There exists k € L', k > 0, such that v(A) = f, kdu. 


REMARK. The function k is called the density of v with respect to 4 and 


is sometimes written k = aa 


PROOF. It is trivial that (ii) = (i). Indeeed, if k1,4 is a function that is 
zero a.e., then its integral is zero. To prove that (i) => (ii), assume that 


(iii) U(X) < +00. 


This hypothesis can easily be dropped later, by taking an exhaustion se- 
quence {A,,} for X. 
Multiplying by a constant reduces the proof to the case where 


(iv) w(X) = 1 and (X, A, 4) will be considered as a probability space. 


We now prove that hypothesis (i) implies the following quantitative ver- 
sion. 


6.2.2 Lemma. Assume that 6.2.1(i) holds. Then, for every € > 0, there 
exists 6 > 0 such that 


(A) <6 implies v(A) <e. 
PROOF. Otherwise there would exist €9 and A, such that 
p(Ap) <27* and v(Ax) > €0. 


Set Gn = UpsnAg; then u(Gn) < 2~"*!. Since G,, is a decreasing se- 
quence, 


(i) pt (AnGn) = 0. 
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Furthermore, since v(G,) > v(An) > €0, 
(a) Vv (OnGnr) = limv(G,) > €. 


But (i) and (ii) contradict 6.2.1(i). 0 


6.2.3 Associated martingales 


Let Aj,...,An,... be the increasing sequence of finite o-algebras construct- 
ed in 6.1. 

Let €, = {e1,..., es} be the atoms of A,, and let a function Y, € L°(Ap) 
be defined by setting 


v(er) 
Y,(er if e, € €, and p(e, 0, 
of ) uler) p pl )# 
Y,(er) = 0 otherwise. 


Then E(Y,) = 3.’ ¥p(er)u(ex), where the sum 5~’ is restricted to those 
atoms such that p(e,) 4 0. Since p(e,-) = 0 = v(e,) = 0, it follows that 
E(¥,) = Cv(er) = (X). 

More generally, let Api be the o-algebra following A,. An atom e, of 
A, can be decomposed into atoms of Apii: er = Gri U Gr.2 UU... U Grs- 

Since the function Y,,1 is constant on each atom g, 


EA» (¥,44)(er) = 5 Do YornlGrsdalons) 


But Yp41(9r,j)M(9r,j) = V(Gr,,j) by the definition of Y,41. Since $9 v(gr,;) = 
v(er), 


BA» (¥,41) =Y¥p, 


and we have proved the following result: 
The Y, form a martingale. 


6.2.4 Lemma. The martingale {Y,} constructed in 6.2.3 satisfies the uni- 
form integrability condition. 


PROOF. Let p be fixed. Given € > 0, we must show that there exists 7 such 
that 


(i) i Ypdu = E(Y, 14) <e for any A € A such that p(A) < 7. 
A 
By 2.6.1(iii), 


E(¥p 1a) = E(E“(y, 14)) = E(Y,E“? (14). 
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Set y = BE“: (1,4): then 0 < y < 1 and E(y) = p(A). 
Introducing the atoms e, of A,, we have 2.4.1. Since vy is A,-measurable, 
y is constant on each atom e; of A,; thus 


E(¥,y) = >- vler)v(er) and E(y) = }~ v(er)uler). 
Define a partition ®, of E, by e, € ®, if y(e,) € [2~-°- 4,275]. Then 


E(¥,y) < ye; where H, = U Crs 
Gree, 
and 
u(A) = E(y) > 50 27"! (A,). 


Let sq be chosen so that 2-s0t1y(X) < 5. Then 


€ 

E(Y, oe ; =. 

(Ye) < SO (Hs) + 5 
O<s<sy 


Let 7’ be the number associated by Lemma 6.2.2 with «’ = 4, and let 


n = 2-%~1n!. Then, if (A) < 7, we have u(H,) < 7’ for 0 < s < sp. It 
follows from 6.2.2 that v(H,) < e’, and thus 
€ 


2 


=€. 


B(Ypy) < 5 als 


This proves (i). 0 


6.2.5 Proof of the Radon-Nikodym theorem 


Since {Y,} is an L' martingale and is uniformly integrable, there exists a 


function k € Li such that Y, = E(x), 
We begin by showing that 


v(A) = E(k 14). 
By the construction of {Y>}, 
i yvdv =E(Y,v) if we L°(Ap). 
In particular, 


B(E“ KE“ 1,4) = i: (B4> 14)dv. 


Set yp = BE“? (14): then 0 < yp < 1. The martingale E44» (14) converges 


pi-a.e. to Ly by 5.5.4, and convergence si-a.e. implies convergence v-a.e. by 
6.2(i). Hence, by Lebesgue’s dominated convergence theorem, 


[etraaya + vA). 
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Since {Y,} is uniformly integrable, so is {Y,yp}. Moreover, Y,yp con- 
verges ji-a.e. to k14, and Theorem 5.7.4 implies that 


| %ovedn [3 dp. 


ii dp = v(A).0 


Thus 


6.3 Duality of the L? spaces 


Theorem. Let (X,A, 1) be a measure space, let 1 <p < +00, and let q be 
the conjugate exponent to p. Then the space of continuous linear functionals 
on L? can be identified with L?. As in I-9.4.3, the dual pairing is written 


a) = | fo du, where fEL?, ge’. 


Proor. Using an exhaustion sequence {A,,} of X, we can reduce the proof 
to the case where (Q,.A, 4) is a probability space. 

A positive linear functional | on L? is a linear functional such that I(f) > 
0 for every f > 0, f € L?. As in II-5, it can be shown that every linear 
functional on L? can be written as the difference of two positive linear 
functionals. It thus suffices to prove the theorem when / is positive. 

Since p(X) < +00, L® C L? and we can define 


v(A) = 1(1,4) 2 0. 


Let Ch = Ur<icn A; and let Cy, = UPL, Aj. Since the A; can be assumed to 
be disjoint, 37, <;<, 1a, = 1c,,. Thus 


v(Cr) = So 1(Ai) = 5 (Ai). 
le, — le. everywhere and 1c, < 1; hence, by Lebesgue’s dominated 
convergence theorem, |/lc, — 1¢.,||z2 — 0. It follows that v is a measure 
defined on A. Furthermore, (A) = 0 implies 14 = 0 in L/,, whence v(A) = 


l(14) = 0. Thus hypothesis 6.2.1(i) is satisfied, and the Radon-Nikodym 
theorem implies the existence of a nonnegative k € Lh, such that 


[* du = v(A). 


Using linear combinations of characteristic functions, we see that 


(i) ly) = | ky dy 
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for all simple functions. If we show that 
(ti) ke L, 
each side of (i) will define a continuous linear functional on L?; since they 
coincide on the dense set of simple functions, they will be equal everywhere. 


Let B, = {x : k(x) < n} and let k, = k1g,; then k, € L*. If g < +00, by 
Fatou-Beppo Levi the negation of (ii) is equivalent to the assertion that ||Kn||za 
oo as n> CO. 


Let a = p~'q and let 
1 


Wn lla 


mea 
ky * 


Un 


Then |lun||ze = 1 and 


[(un) = i = i = [lkn|lby ~ 00 asn— oo, 


contradicting the inequality 
(tn )| SMI) lun iice- 


The case q = o& is treated in the same way, using the inequality 


i1—1s,) > nl1—1., |r. 


6.4 Isomorphisms of separable probability spaces 
6.4.1 Atoms of a measure space 


Let (Q,.A, P) be a measure space. A € A is called a P-atom if P(A) > 0 and 
if, for any B € A such that lg < 1y ae., either lp =O ae. or lp =1,4 
a.e. This notion of atom corresponds to the one introduced in J-6.2, except 
that we now consider the classes defined by equality a.e. 


6.4.2 Structure theorem (nonatomic case). Let (0,A,P) be a sepa- 
rable probability space which is complete and has no P-atoms. Then there 
exists f € L™(Q,A) such that 0 < f < 1 and f is a probability space 
isomorphism from (Q,A,P) onto [0,1] equipped with Lebesgue measure. 


ProorF. Let {A,} be the increasing sequence of finite sub-c-algebras of A 
constructed in 6.1. Note that we could regroup the atoms of A,, that have 
measure zero with an atom of strictly positive measure, to produce a new 
sub-o-algebra A’, such that P(A) > 0 if A € Aj, and A # 0. Assume that 
this has been done. 

We next enumerate the atoms of A,, say €1,1,.-.,€1,s, Where s = s(1), 
and then the atoms of A», consistently with the enumeration for A;. That 
is, all the atoms into which e;.; is decomposed appear first, then the atoms 
into which e; 2 is decomposed, and so on. 
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With the atoms of A, listed as €g1,...,€g,s, 5 = S(q), there exists a 
strictly increasing integer-valued function y such that 


€q,5 is decomposed into the atoms €g+1,;, with y(s) <j < y(s +1). 


Having defined this coherent enumeration of the atoms of Ay, we set Bg 7 = 
P(éq,r) and define 


1 
f,(x) = 3 Pat + Sy Baa where & € €q1. 


r<l 


Then f, € L*(A,) and 


EB“ (fs41)(Cas) = So Botir 


r<y(s) 
1 1 
a » P(e€q41,5) | 5 Bath + y Pati 
P(€q,5) ; 2 
y(s)<j< (stl) m<j 


In the second sum, observe that 


2 


1 1 
3 > Botig| = 5 tP(ea.s)]- 


p(s)<j<p(st1) 


Similarly, the first sum can be written }7,., 84,4, whence 


E+ (f.41) = faq 


The f, form a martingale; since 0 < f, <1, they form an L? martingale. 
This martingale converges a.s. to its final value f € L™(A), and fy = 


EAs). 


Furthermore, let 
Nq = sup Bq,r. 


Then {n,} is a decreasing sequence. Assume for contradiction that 

(2) lim yg =€ > 0. 

Then there exists a decreasing sequence of atoms an € An such that 
P(lim | an) = lim P(an) =e > 0. 


Let C = lim | an. Since the probability space (,.A, P) has no P-atoms, we can 
find D € A such that DC C and P(D) > 0, P(C — D) > 0. 
Since the o-algebras A, P-generate A, 


(és) E“(1p) 1p as. 
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But, since 1p < 1c, EA» (1p) is constant on the atom a,; that is, 


(277) lim E* (lp) = nlc. where +, is a constant. 
This contradicts (ii). Hence (i) cannot hold, and 
(iv) lim 7 = 0. 


(uv) The distribution of f is Lebesgue measure. 
Since the functions f, converge a.s. to f, their distributions converge to the 
distribution of f. Let u € C((0,1]) and consider 


E(u(fn(w = ae 26 ru (fa. + > a0} . 


l<r 


The right-hand side is a Riemann sum for u, and since the mesh of the parti- 
tions tends to zero by (iv), the Riemann sums converge to fu dx, whence (v). 


(vi) Let A’ = f~'(BR), where BR is the Borel algebra of R. Then L&(A) = 
LBA’). 
Let 8,4 = ae 23y.;- Then, by the construction of the fj, 
(Gee) = €q.r if 9 > q. 
By the a.s. convergence of the f;, 
FUE Peal) @) Car 2 FOG Bi r+i))- 


Since P(f~'(3),-)) = Lebesgue measure of {@,,,} = 0, the two inverse images 
above differ by sets of probability zero. Hence L?(A’) > L® (A), and (vi) follows. 


6.4.3 Structure theorem (general case). Let (Q,A,P) be a separa- 
ble complete probability space. Then there exists a discrete measure y = 
So cede, on [0.1] satisfying the following two conditions: 


(i) [=> eee. 


(ii) Setting 
du = dy + (1 - |[y[)dé, 


there exists a function f in L*(Q,A) which is an isomorphism from 
(Q,A,P) onto [0,1] equipped with the completion with respect to yu of its 
Borel algebra. 


PROOF. Let Aj,..., Aq... denote the P-atoms of A. Since $7, P(Aa) < 1, 
the set of P-atoms is countable. Let cy = P(Ag) and let &j = i then the 
measure ¥ is well defined. If ||y|| = 1, the desired isomorphism is clear. If 
lvl] < 1, set 


Q Q — UrApg, 


A = AAQ, 


6 Theory of Differentiation 227 


and ee i 
PO Fa 


Applying Theorem 6.4.2 to (Q, A, P) shows that (ge L©(Q, A). Now let 


P(A) for AEA. 


f-F+ Do p14,0 


6.5 Conditional probabilities 


We would like to express conditional expectation by an integral. This has 
already been done in 2.4.2 in the case of conditioning relative to a finite 
sub-o-algebra. 


6.5.1 Theorem. Let (,.A,P) be a separable complete probability space, 
let (X,B) be a measure space, let f € Mp((Q, A); (X,B)), and let w= f,P 
be the distribution of f. For a o-algebra A’ on Q, let 1(Q, A’) be the set of 
probability measures defined on A’. 

Then there exist 


(i) ao-algebra A’ CA such that L(A’) = LB(A) and 
(it) a mapping x + v, from X to n(Q, A’) that is defined p1-a.e. 
and satisfies 


Blu(Flw))h(w)) = f u(erdula) ; hw)dva(e)| 


for any u € L&°(B), he L(A’). 


(The expression in brackets on the right-hand side is a function in L},(B).) 


PRoor. Using Theorem 6.4.2 on isomorphisms of probability spaces and 
noting that the “atomic set” appearing in 6.4.3 can be handled easily, 
we reduce the proof to the case where Q = [0,1], A is the o-algebra of 
Lebesgue-measurable sets, and P is Lebesgue measure. Taking A’ = Br, 
this reduction to [0,1] allows us to use the theory of Radon measures. 

Let gn = x", x € [0,1]. Let W denote the finite linear combinations of gn 
with rational coefficients, that is the polynomials with rational coefficients. 
For every w € W, the conditional expectation of w given f is defined in 
the complement of a p-negligible set. Taking a countable union of such 
negligible sets, we can find By € B such that u(Bo) = 0 and 


l,(w) = E(w(w)|f(w) = x) is defined Vx € Bo. 


Then 1, is a linear functional on the Q-vector space W. Since |l;(w)| < 
\|w||c, the Hahn-Banach theorem implies that 1, extends to a linear func- 
tional l/, defined on C([0, 1). 
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Hence, by II-5.2, there exists a Radon measure v, on [0,1] such that 
In(h) = f h(w)dvelw), Yh € C((0,1). 
In particular, 
E(u(w)|f(e) = 2) = f wl) dve(w). 


This formula extends by continuity from W to L¥(Q). 
Note finally that l,(w) > 0 if w > 0; whence v, is positive. Taking 
f =18 shows that v,(Q) =1.0 


6.6 Product of a countably infinite set of probability spaces 


Theorem. Let (Q,,An, Pn) be a countably infinite set of probability spaces. 
Then there exists a unique probability space (Q,.A, P) with the following two 
properties: 


(i) For every q, there exists a morphism from the product of the first q 
probability spaces (Qn,An, Pn) to (Q,.A, P). 


(ii) Furthermore, (Q,A, P) is the smallest probability space satisfying (i). 
More precisely, if (Q', A’, P’) is a probability space satisfying (i), then there 
exists a morphism of probability spaces ® : (Q', A’, P’) — (Q, A, P). 


PROOF. By the structure theorem (6.4.3), we can reduce the proof to the 
case where 2, = [0,1], A, is the Borel algebra, and P,, is a Radon measure 
ft, which is the sum of a discrete measure and a multiple of Lebesgue 
measure. Let 


= oi": 
Then Q is a compact space, which will be equipped with its Borel algebra. 
Define an injection 
Fq : [0,1]? — [0, 1] 
by setting 
Fae nets Ep) = (GeO 0). 
Let 
(fq)* (Hr @ M2 @ ++: ® Hq) = P,. 


Then FP, converges vaguely to a Radon measure Px, and (Q, Bo, Ps) is 
the desired probability space. O 
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Gaussian Sobolev Spaces 
and Stochastic Calculus 
of Variations 


Introduction 


In Chapter IV, we began by basing probability theory on the theory of 
abstract measure spaces of Chapter J. We then studied convergence in 
distribution by means of the Fourier transform on R?. Thus both abstract 
integration theory and classical analysis were necessary to obtain the limit 
theorems of probability theory. These two sources of Chapter IV derive 
from the dual nature of distributions. Although a distribution is attached 
to a very abstract object, a random variable on a probability space, it can 
also be thought of as given by a Radon measure on R. Borrowing an image 
from Plato, we might say that distributions have a daemonic nature: they 
come simultaneously from celestial objects (the abstract theory of measure 
spaces) and terrestrial objects (analysis on R). 

In this chapter, we study the “regularity of distributions”. The concept 
of regularity is based on the existence of a standard Radon measure on R, 
Lebesgue measure. A distribution is called regular if it has a density k with 
respect to Lebesgue measure, very regular if k is a C™ function, and so on. 
Lebesgue measure is defined in terrestrial terms as the translation-invariant 
Radon measure on R. 

To study the regularity of distributions, we will have to go up to the 
celestial level of quasi-invariant measures. A Gaussian probability space 
is a probability space equipped with a sequence of independent Gaus- 
sian random variables that generates the underlying o-algebra. On such 
a space, the probability measure is quasi-invariant under the action of 
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distinguished translations, those of Cameron-Martin. The action of trans- 
lations on L?(R“) led in Chapter III to the definition of the Sobolev spaces 
A(R"). 

Proceeding similarly here, we will define further celestial objects, spaces 
of infinitely differentiable random variables. We can then use differential 
calculus on both R and the probability space. The interaction, through 
a random variable, of these two kinds of differential calculus will make it 
possible to study the regularity of distributions. 

The use on an abstract probability space of a natural underlying differ- 
ential structure, as developed here, is commonly called “stochastic calculus 
of variations” . 


1 Gaussian Probability Spaces 


1.1 Definition. Let (Q,A,P) be a probability space and let X be an R"- 
valued random variable defined on Q. X is called a Gaussian random vari- 
able if the distribution of X is a Gaussian measure on R”. (See IV-4.3.3.) 
Gaussian measures and Gaussian random variables are sometimes called 
normal. 


REMARK. If X is Gaussian, X is in L? Vp < +00. 


1.2 Definition. Let (Q,A,P) be a probability space and let {X,,} be a 

sequence of independent normal random variables. (Q,.A, P) is said to be 

a Gaussian space if the o-algebra generated by all the X,, is equal to A. 
We intend to construct a basis for L?(0,.A, P). 


1.8 Hermite polynomials 


. . ty 22 i 
On R, we define the Gaussian measure 1;(A) = ty exp(-4) ee. 


1.3.1 On L?(R, 1), we consider the scalar product 


Com [ eow(an, 


the differentiation operator d = a and the operator 6 defined by 


dy wy a any? 
by(a) = a a —e" oe (© - 29) : 
1.3.2 Lemma. When vy and y) are C! functions with compact support, 


(2) (dp|w) = (ploy). 
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(ii) Moreover, (i) remains true if p and w are absolutely continuous and 
in L?(R,v), with dp and 6p in L?. 


(itt) dé — 6d = Identity. 


PROOF. (i) follows from an integration by parts, and (ii) from approximat- 
ing y and w by compactly supported C?! functions. A different proof will 
be given later, in 2.2.3. 0 


1.3.3 Definition. The Hermite polynomials are defined by setting Ho = 1 
and H,, = 6"1 for n > 1. Here 6” = 60...06, n times. It is immediate that 


Ay, = él=a, 
Hy, = 661=27—1, and 
Hy, = ®1=23 —3z. 


1.3.4 Proposition. H,, is a polynomial of degree n whose highest-degree 
term is x”. The following relations hold: 


(i) 6H, = An+13 
(it) dH, = nHn-1;3 
(iit) (6+d)H, = Hy; 
(iv) 6dH, = ndHpy. 
PROOF. 


(i) follows immediately from the definition. 
(ii) is proved by induction, using 1.3.2.(iii): 


dH, = d 6Hy-1 = 6 dHy-1 + Hp—1 = (n— 1)6Hn_2 + Hn-1- 


(iii) follows from the definition of the operator 6 (1.3.1). 
(iv) follows from (ii) and the definition of H,. 0 


1.3.5 Corollary. Let F(g(x))(€) = ite e§* 9(x)dx be the Fourier trans- 


form of g at the point €. Then F(H,(x)e7® /2) = iméte © /?, 
PROOF. 


FH 1eH-V/?)(E) = (el |6"1) = (dr e**|1) = 1 E"(0'*|1) 


+ 
nen | OY pige—a?/2 Ax = rere © 7/2 0 
as V2n 
oo 


1.3.6 Theorem. {att} is an orthonormal basis of L?(R,11). 


(n!)2 
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PRooF. (i) We show that the polynomials H,, are dense in L?(R, v,). Oth- 
erwise there would exist y € L* such that (y|z*) = 0 Vk = 0,1,... 
Let 


F(t) = / pvett?—”*/2dy, 


Setting t =o + iT, we have 


v7? /2.-7 2-7/2 es —2rv,—v?/2 a 
lp(v)ije7” “eT ?du < lp(v)|?e7” /2dv e “Te? /*dyu ; 


Thus I-7.8.4 can be used to differentiate under the integral sign, showing 
that F is an entire function of t. Since 


F®) (0) = i® f ukp(ujert Pav =0 


for every k, F = 0. Applying the inverse Fourier transform, we see that 
yp=0. 


(ii) The polynomials H;, are linearly independent since the coefficient of x* 
in Ay is 1. 


iii) We show that the functions = 43H; form an orthonormal system. If 
(kI)I7 
s>k, then 
(Hy,|Hs) = (d°6*1|1) = 0, 


since d°6*1 = 0. If s = k, then d*6*1 is the product of s! and the coefficient 
of the highest-degree term of H,; that is, d°651 = s!. 0 


1.4 Hermite series expansion 


1.4.1 Theorem. Let g be a C™ function on R such that g and all its 
derivatives are in L?(R,14). The expansion of g with respect to the basis 
mnie Hn 18 

x0 


g(2) = > Bly Hy(2), 


n=0 ~ 
where E(g') = (g'™|1) and g'”) is the nth-order derivative of g (g) = q). 


Proor. Let g(x) = 7.9 Cn Hn(a) be the Hermite series expansion of g. 
Integrating term by term and using the orthogonality of the polynomials 
Ay, we have 


ne ex 
Ely} 


Moreover, (Hx|g) = (6*1\g) = (|d*g). Hence Cy = 


kl 
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n=0 nl 


1.4.2 EXAMPLE. exp(—& + tz) = 0, 4 HAn(z). 
PROOF. 


and Sas i 5 
t x dz 
t” exp| —— + tax } ex (-$) aaa gen oS 
ye r( 2 ) PS han 


1.4.3 Corollary. 2-2 = ori ty 27+) exp(—= + zx)dz, where y is a 


n! 
simple closed curve around the origin in C. 


ProoF. This follows from 1.4.2 and the Cauchy formula. 0 


1.5 The Ornstein-Uhlenbeck operator on R 


1.5.1 Definition. £ = éd = os + ct is called the Ornstein-Uhlenbeck 
operator on R. 


1.5.2 Lemma. CH, = n4H,,. 
ProorF. By 1.3.4(iv). 0 
1.5.3 Definition. Let Ps be the operator defined by 
oe 2yq dx 
Pafty) = | f(zcos@+ysin6)e * es 
CO V 


Qn 


REMARK. The integral above takes the same value for 6 and a — 0; it 
depends only on sin @. 
1.5.4 Proposition. 

(i) (Poylb) = (yl Pov); 

(tt) dP9 = sin 0P9d; 

(itt) Pod = sin @ 6Po; 

(iv) LPo = Pol; 

(v) PoH, = (sin 6)" Hp. 


Proor. The measure exp(— $i) ed is rotation invariant. (i) follows 
from this; (ii) is immediate; (iii) follows from (i) and (ii) and the fact that 
(6el) = (eldw). (i) and (ii) imply (iv): 

(v) is proved by alternately using (iii) and the fact that 6H, = An41: 


PoHn = Pp6Hy—1 = sin 6Pp Hn) = sinO 6Pe6Hy_2 = sin? 6 6° P9Hp_2. 
Iterating this gives 
(sin 0)"6" Po Ho = (sin8)"6"1 = (sin 8)" H,.O 


1.5.5 Proposition. Let 0(t) = arcsin(e~'), where t > 0. Then 
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(i) (Poa f) = -L( Poa f), and 
(tt) Pace) 0 Por) = Pocesey)- 


PROOF. Since Pg depends only on sin 6, we can set 6(t) = arcsin(e~*). Then 
d 729 en ®/2 dy 
— (P, = ~—a sin 6 + ycos 6) f'(x cos 6 + ysin @) ——— 
aa (Pef)(y) Ls ( y cos 6) f(x ysin @) Tix 
= ycosOPe(df)(y) — sin 6 cos @Po(d? f)(y), 


where the second term comes from an integration by parts. Using 1.5.4(ii), 


d _ 2 cos @ 
ao POF hy) = (ydPo f(y) —d Pof(y)) 5: 
Since df = — 33g, 
d d dé 
a Paw) = wo Pela = —LPayyf. 


This proves (i). 

We now prove (ii). By 1.5.4(v), Pg o Py H, = sin(n@) sin(n6’)H,. Since 
sin 0(t) sin @(t’) = sin O(t + t’) = e~ t+"), this implies (ii) for finite linear 
combinations of Hermite polynomials and hence, passing to the limit, for 
L?, 


1.5.6 Lemma. (Po f)(y) = oe f(z) Kola, y)e-* /? se, where 
4 1 2ry sin 6 — sin? O(a? + y?) 
aay) | cos 6| ae 2cos? 6 , 


PROOF. This follows from the change of variables u = x cos@ + ysin@ in 
1.5.3. O 


1.5.7 REMARK. Since the operator Po is self-adjoint with respect to the 
scalar product (see 1.5.4(i)), the kernel Kg is symmetric in x and y. 


1.5.8 Examples of expansion in Hermite series 


(i) H,(xcos@ + ysin@) = Dae (;) (cos @)?(sin 0)"~? H, (x) An—p(y) 
Gi) ial og (in )" Hn (2) Hn(y) = exp(sin§ 6,62)1(x)1(y), where 


d? d a? d 
by are +2@a, and 62 = —qa t+ Yq: 
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PROOF. 


_ (cos 9)? n! 


(i) (£)?H,,(zcos6 + ysin 6) Tapp An-p(z cos 6 + ysin 6) if 
p <n. To evaluate P9Hn_», use 1.5.4(v) followed by Theorem 
1.4.1. 

(ii) Expand y+ Ko(a,y) in a Hermite series, using Theorem 1.4.1: 


oe d\" 2, dy Tee 275 dy 
— | Ko(z, VP = Ko(a,y)(6"1)e—¥ (7 —— 
x (ai) a ale gl age OHO pe 


+00 


_y2/q dy 
Ko(2,y)Hn ve, 
aes (x y) (y)e Vin 


By 1.5.4(v), this equals (sin @)"H,,(z). O 


1.6 Canonical basis for the L? space 
of a Gaussian probability space 


1.6.1 Notation. Let RN be the set of real-valued sequences and let B,, 
be the Borel algebra on RN. Projection onto the first n coordinates is 
denoted by 7, : RN — R”. It follows from the structure theorem (IV-6.6) 
that there exists a measure v on RN such that the direct image (1,)4V of v 


2 
. 22/2 
under 7 satisfies (1™)«/ = Un, where vp, = [Ti_, ¢ 7a, | 42. By denotes 


the inverse image under 7, of By. 


1.6.2 Proposition. The increasing sequence {B,} of o-algebras is a fil- 
tration of the space (RN, Boo, V). 

The space (RN, B..,v) is a@ Gaussian probability space and B,, is the 
a-algebra generated by the Gaussian variables X,, of projection onto the 
nth coordinate. 


PROOF. Follows from the definitions. 0 


1.6.3 Proposition. Let f € L?(Q,A,P) There exists f : RN > R such 
that 


fla) =f OW) Kae Myla) ds 
PROOF. By Dynkin’s theorem, IV-1.5.4. 


1.6.4 Lemma. /f (Q,A, P) is a Gaussian space and {X,} is a sequence of 
Gaussian random variables that generates A, then X, € L??(Q,A, P) for 
1l<p<o. 


PROOF. The integral i: ee ® 2 de converges. O 


1.6.5 Definition. Let € be the set of sequences of integers (ni, n2,...,0,...) 
for which all but finitely many terms are zero. For p = (n1,...,nx,0,.. je 
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Eandw €Q, let 
k 
Hp(w) = [] Hn, (Xi(w)), 
i=1 
where H,,, are the Hermite polynomials on R. We write p! = Te ni). 
1.6.6 Theorem. {(p!)~!/?H,(w)} is an orthonormal basis of L?(Q, A, P). 


PRoor. We prove the theorem for R?. By IV-3.5.1, linear combinations 
of the form f = >> fihi, where fi(&1), hi(€2) € L?(R,1), are dense in 
L?(R?, v2). Approximating the functions f;, h; by their expansions in Her- 
mite polynomials shows that the set of functions of the form 


Ar, (61) Ak, (£2) 


2 4 
generates L?(R?, exp(—St& +8 )s 4 ). 
Moreover, 


(Fas (€1) Hi (6) (Es) Hes (a)) = ep 8(815 Fx )(825Fa)-© 


1.6.7 Theorem (Taylor-Stroock formula). Set 
E(d,f) = [ovar 1.8% Far, ..-5te)ve(de) 
for p =(n1,7n2,...,7K,0,...,0,...). If E(Opf) exists for every p, then 


(w= = E(pf)Hplw). 
pe€é . 


PROOF. It suffices to prove 1.6.7 when 2 = R” and f : R” — R; that is, 


. 1 Z 
ficial. pie Ops) p(t. --12n)- 

pe€ 

The proof proceeds as in 1.4.1. 0 


1.7 Isomorphism theorem. There exists an isomorphism yp between 
L?(Q, A, P) and L?(RN, Boo, v). 


1.8 The Cameron-Martin theorem on (R‘, Boo, v): 
quasi-invariance under the action of ¢? 
1.8.1 Proposition. Let (0,4, P) be a probability space and let {M,} be 


a sequence of integrable random variables such that sup, E(|Mn|?) = Cp < 
+oo forp>1. Then {M,} is uniformly integrable. 
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ProoF. [4 |MnidP < ||Mn\lze[Valloa < CpP(A)!/? for A € A (cf. IV- 
wae anal 


In what follows, we consider the Gaussian probability space (RN, Bo,V) 
(cf. 1.6.1 and 1.6.2). 


1.8.2 Notation. Let @? be the space of sequences x = (11,...,2n,-- Ee 
RN such that 0%, 2? < too. The scalar product (zly) = 7%, ey: is 
associated with the norm |a| = \/)> 2? on @?. Let Ty : RN — RN denote 
the mapping defined by 7,(z) = x + y and let (7,).v denote the image of 


the measure v under Ty. (See IV-1.4.3.) 


1.8.3 Theorem (Cameron-Martin). If y € ¢, then the image measure 
(Ty)«Y is absolutely continuous with respect to v and the density is given by 


ACy)Y) 2) — 00 (3 Ykezk — 5D] : 
k=1 


k=1 
PRoor. Let Sp(z) = op, Yeek- 


(i) The sequence {S,,} on (RN, B,.,v) is an L? martingale relative 
to the filtration {B,}. Hence {S,,} converges a.s. Let S.. = 
limp—oo Sn. Then Sa < +00 as. 

(ii) The a.s. convergence of S,, implies its convergence in distribu- 
tion. This follows from IV-1.8.5. 

(ili) Set My (z) = exp(S haa YRzm — pba Yi) 


The sequence {M,,} is a martingale relative to the filtration {B,,} and, 
for all n, E(M,) = 1. By Fatou’s theorem (IV-5.6.3), the limit M. = 
limn soo My exists a.s. 


(iv) E(M?) =E 


n n 
1 
exp (Spm ara > )| . 
k=1 k=1 
But 
E 


. 1 2\|_ 
exp (Seema 5P S “) =1. 
Hence 

1 n 
Pp) — Bp sd 2 
E[M?| oo( Jt :p) Sf] ¢ 


M,, is therefore bounded in L? and hence uniformly integrable. By IV- 
5.8, M,, converges in L? to M,,. Thus M,, is the conditional expectation 
of M,,. Given a function f depending on the first r coordinates, we have 


E((ty).f) = E(M,f) = E(Mof). 
The equality E(M, f) = E(M..f) extends by continuity to all f in L?. 0 
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REMARK. Although v is not invariant under the Cameron-Martin trans- 
lations T,, the measure (7,).” is absolutely continuous with respect to v. 
This property of (7,)«1 is called quasi-invariance. 


2 Gaussian Sobolev Spaces 


2.1 Finite-dimensional spaces 
2.1.1 Notation 
Let f € L?(R",1,). We write E(f) = f f(ai,...,a~)dvy. By 1.5.8(iii), f 


can be expanded in a Hermite series. If p = (pi.....pe) € N*, we set 
p! = py!p2!...px! and |p| = pi + po +... + De: 
Then 


(cues Ss tg) Oe 


Pi---Pk p! 
Let 
of ahha Cd aatevly ie) ree 
Then 
lian = So Moromllisin = 30 oa 
pit Di --Dk 


For a C? function ¢ : R* — R, we have the partial differential operators 
O 
Oye = a and 6;9 = (—0; + xj). 


2.1.2 Operators on L?(R*) 


An operator T defined on the polynomials can be extended to a formal 
operator on L?(R*) as follows: for f € L? (vx), let 


Ee as SS Cpr. m5; T[Hp, (11)... Hp, (tx))- 


P1---Pk 


The domain of T consists of those f € L? such that Tf € L?. 
Restricting our attention to differential operators, we consider 0;, 6;, and 


L= Sj -1 60;. Let 


.. Hy, (Lk 
(%) 2 2 biOst = > Cpy.. pe ( fjp| meet) 
Pk 
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and let the gradient operator be defined by Vf = (0, f,...,Oxf). For z = 
(21,-+-52j,+++52k), With 2; € L?(v%), j =1,...,k, we set 


k 
j=l 


When 0;f € L*(v%) for j = 1,...,k, we set 


i 1/2 
(iit) IV fll = SN 


j=l 


Similarly, if 0;,...0),f € L?(ve) Vj1,...,Jjs, we set 


1/2 
(iv) IVF = | DARA], 
ne 1/2 

[Ver = 1X ORO y 


Siyeds 


We intend to determine the domains of the operators V, V?, and CL; that 
is, the set of functions f € L?(v,%) whose images under these operators are 
in L?(v,). Recall that the Sobolev space W?,,. was defined in III-3.4.3. 


loc 


2.1.3 Definition. 


s=0 


D?(R*) = {! € Wrioc(RR*) : > IV fllzew.) < too 


2.1.4 Theorem. D?(R*), with the norm ||f\|pe = Theo IV fllaeu,): és 
a complete space. 


ProorF. By III-3.3.6. 0 


2.2 Using Hermite series to characterize D?(R) 
in the Gaussian L? space 


Let f €¢ L? = L?(R,) and let Py en(f) 2 = 9 fn be its Hermite 


ni 
series expansion. 
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2.2.1 Definition. On L?, the formal operators d, 6, and £ are defined by 


df =Srenl SE = Sena 22s 
n>1 z n>0 ee a 
= , n+l n+l. 
of Lent) = gp ea jr iyl 
> H, > 
Li=) oN Gop 


n>1 
The reader can easily verify that £ = 6d. 
2.2.2 Lemma. 
(i) [Cache a nll frllz2- 
For an integer s > 1, 
(tt) en(d°f) = eno s(f); 
(iit) cn (O° f) = 0 ifn <8 and cn (6° f) = GAjen—s(f) ifn 2 8; 
(tv) Cn({L?f) =n en(f). 
PROOF. (i) follows from 1.3.6, since fn = cn(f) Hn 


The other identities follow from the definitions and from the relations 
dH, = nH,_, and 6H, = An41. 0 


2.2.3 Proposition. Let f € L? and let DVnso dn be its Hermite series 
expansion. Then for s an integer, s > 1, properties (i) through (iv) below 
are equivalent. 


(7) wPer?, 
(ii) Son |lfalli2 < +00. 
(iii)  feD?(R). 
(iv) OPE: 


In particular, D?2(R) is the domain of the operators d®° and 6° on L?. If 
both f and g are in D{(R), then 


(v) (dflg)r2 = (flég) ce. 


Proor. (i) > (ii). d? f = 1,50 Cn¢s(f) 2 since dH, = nHy_1. Hence, by 
2.2.2, 7 


»\1 
la fI22 = Do lense = oO asl? 


n>0 : n2=0 


This proves (ii), since (nts)! ~(n+s)S asn— o. 
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(ii) = (i). If (ii) holds, the series 7.5 Cn+s(f) converges in L? and 
d°f € L?. 


(iv) > (ii). By 2.2.1, 6 Ff = Dysoen(f)(m + I(r t 2)...(n + s) eats. 
Hence, by 2.2.2, 


OFRe = len PetVert)+s) 
n>0 

= Si(n+1)(n+2)...(n+s)llfall?. 
n>0 


(ii) = (iii). We give the proof for the case s = 1. Let y € D; then 


fee@de = DY fe @eatnt a 
n>0 
= => foe a 
n>0 
= =f o(eafte)ar 


Hence f € W?,,.- Since df € L?, it follows that f € D7(R). 


(iii) = (ii). As above, we give the proof only when s = 1. Let df be the 
weak derivative of f. Then df € L? and df = 0,35 cn (af) 2 for y € D, 
and hence ~ 


[e@iteoas=— f elo arteae. 
This implies that c,+1(f) = en(df), and (ii) follows. 
(v) is proved by using the orthogonality of the Hermite polynomials. 0 


2.2.4 Proposition. [f s = 2p, then (i), (ti), (iii), and (iv) of 2.2.8 are 
equivalent to 


(vi) Lifer. 


PROOF. We verify only that (ii) = (vi). Since Lf = V7 59 en(f)n#2, 


n: 
2 gn? 2 2 
IFll2 = So emf)? = Do nil fall? 
n>0 “— n>0 
This implies equivalence when s = 1. The proof for s > 1 is similar. 0 


2.2.5 Lemma. The following identities hold: 


(@) Sf llz2 = lidfllz2 + Wf lize 
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and if k is an integer, k > 1, 
(ii) Jd Of lz = Ila" fiZe + (2k + Dla fliz2 + RPE" Fiz. 
PROOF OF (i). 


lof llz2 = So (e+ Dlfallie = Wfollze + Sor + 2)Ilfrsallze, 


n>0 n>0 


lldfllz2 = Sot Vil faa lize 


n>0 


Hence 


Sfllz2 — lldflliZ2 = Ufollze + D> Wfnsallde = Wf llze- 


n>0 


PROOF OF (ii). Since en 41(6f) = (n+ ae and |en(f)|? = eae 


Id*oflli2 = y leeexl éf)P x = = eae 1( (n+ hPa 
n>0 n>0 
n+k)! 
SiH K)ifnee—ill?. 
n>0 : 
Since 


yatt pe, = SER al, 


n! 
n>O 
we can compute ||d*6f||7. — ||\d**! ||. by observing that 


(ntk+2)\n+k+2) (n+k+1)! (n+k+1)! 
(n +2)! n! (n+ 2)! 


[(2k+1)n+k?+4k+2]. 


2.2.6 Lemma (Differentiation of composite functions). Let g € 
Di(R) and let w = gu be the image of 1 under g. If yp € L*() and 
dy € L'(p), then gog € Di(R) and d(y 0 g)(x) = (dp)(9(x))dg(2). 
PrRooF. d is the extension of the differentiation operator 4 are By Hélder’s 
inequality, if f; ¢ L4 and fo € L4, then fi fo € L?. 0 


2.2.7 Lemma. /f f € D/(R) and g € D{(R), then fg € D7(R) and 


(i) d(fg) = fag + gdf. 
If f © D5(R) and g € D3(R), then fg € D3(R) and 
(it) L( fg) =L(f)g + L(g) f + df dg. 


PROOF. (i) and (ii) follow from identities obtained when f and g are dif- 
ferentiable, since d (respectively £) is the extension of the operator a 
(respectively 6d — see 1.5.1). Hélder’s inequality implies that d(fg) € L 
and £(fg)€ L?.0 
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2.38 The spaces D?(R*) (k > 1) 


2.3.1 Proposition. Let f € L?(vx) and let f = Yo, oy frr.pe be tts 
Hermite series expansion. Then the following statements are equivalent: 


(i) IV fll € L°(R*, vx). 

(ii) Yo Ililfpr.rellz20n) < +90. 
Pi---Pk 

(a) f €D?(R*). 


ProoF. Note that 
Cp ...Pj—-1PjPj41--Pk (f) = Cp) ...Pj-1Pj—-1Pj41--Pk (O;f). 
Hence 


lexaecnr 
IFIBeoy = So vee = Pll 


P1-+-Pk P1--:Pk 
(i) < (ii) then follows from the relation ||Vf||Z2 = i lO;fllzea,): For 
(ii) = (iii), see Proposition 2.2.3. O 


2.3.2 Proposition. Let f = ae foy..p, be the Hermite series expan- 
sion of f € L?(vz). The following properties are equivalent: 


(7) Lf € L? (uy). 

(it) >= Iw? life» ll? < +00. 
P1---Dk 

(iit) f €D3(R*).o 


PRooF. See Proposition 2.2.3. 0 
2.3.3 Definition. Let Vif = (0;, 05, hee Opt igi case 
If 0;,0;, ...0;,f € L* (vm) for every j1..., Jr, then 
IV" flliZe,) = x |9;, --- OF llZ2a,)- 
Sidr 


2.3.4 Proposition. If f € D3(R*), then 
Lf liz2 = IV? fllt2 + IVF llz2- 


PROOF. It suffices to check the formula for differentiable functions. In this 
case, £ = )), 6;0;, where 6; = —0? + 250). 


Hence 

ICfilz2= > 6 Of Of) = D— 18), Of lle + ye lO; fllz2, 
mae yes : Jide 
jo=l,..., 


where we have used 1.3.2(i) and (ii). 0 
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2.4 Approzimation of L?(R,v) by L?(R",v) 


Let X;: RN — R denote projection onto the ith coordinate and let B, be 
the o-algebra generated by the random variables {X;}1<i<n. Then {Bn}n 
is a filtration (see 1.6.2). 


2.4.1 Lemma. Let f €¢ L?(RN.v). Then 


() fr = EP (f) is a martingale in L?(RN,v) and 

(tt) limn—oo || fn — FllropRN) = 0. Thus f is the final value of a 
martingale in L?(RN) relative to the filtration {By} (see IV- 
5.5.2). 


PROOF. 


(i) By IV-5.1.5. 
(ii) By V-1.8.1 and IV-5.8.1. 0 


2.5 The spaces D?(R‘) 


Next, starting with the Gaussian Sobolev spaces D?(R*) on the finite- 
dimensional space R* and using the martingale approximation of Lemma 
2.4.1, we will study the Gaussian spaces D?(RN). If f ¢ L?(RN,v), the 


function f, = EB (f) depends only on the first n variables: 


fn(L) = Gn(X1(z),...,Xn(£)) = Yn(21,---,2n). 


2.5.1 Definition. We say that f € D?(RN) if f, = EB»(f) ¢ D2(RN) for 
all n and sup,, Il frllpe~RN) < +00. 
In this case, we set || f||pe¢N) = SUPn Il fnllpecRN): 


2.5.2 Operators on L?(R‘,v) 
Let f € L?(R,v) and let its Hermite series decomposition (see 1.6.7) be 


k 
fa) = 0 seo) [1H (ai), 
pe€é : i=1 


where p = (pi,...,pr,0,...,0). 
As in 2.1.2, we set 


k 
Osa = >. sent l)2 (11 Hine) 


pee i=l 
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and 
Lhe) = Do Golf) se (I e)). 
pe€ 
2.5.3 Lemma. Let f € Di(RN). Then 
(i ajBPn(f) = BP (o,f) if J <n: 
(it) 0,EB-(f) =0 if j>n. 


Proor. Let f € L?(RN,v) and let its Hermite series expansion be 


ee Cy, lS Hy, (a1) «++ Hp, (tr). 


gerry 


Then 
> Cp,...px (Ff) Hn (x1)... Hp, (Tk), 


P1,---sPk 
1<k<n 


since the variables X; are independent and EB (H,(xp)) =Oifk>n. (i) 
and (ii) follow immediately. 0 


2.5.4 Lemma. Let f € D2(RN) and let fy, = E2"(f). Then, fork EN, 
the sequence {Okfn}neN converges in L?(RN) and 


(i) Of = lim OBB» (f), 


ProoF. Let k be fixed. Then {0;f,} is a martingale by 2.5.3(i); by 2.5.1 
it is an L? martingale, which converges by IV-5.3.3 

To prove (i), note that 0, f — limy Ox fn is a continuous linear map from 
D?(RN) to L?(RN) which vanishes on the set of functions depending on a 
finite number of coordinates. 0 


2.5.5 Lemma. Let f € D2(RN). Then Lf = limp. EB (Lf). 
PRoor. Check that EB (Lf) = cEP«(f) on the Hermite series decompo- 
sition of f. 0 


2.5.6 Theorem. Let f € L?(RN pv). Then the following statements are 
equivalent: 

(i) f €DU(RN). 

(ii) For every k, k > 1, Of € L?(RN,v) and YO, lO f\lZ2 < +00. 


Furthermore, the space D?(R) is complete in the metric given by the 


norm |lfllpz = lifllz2 + Ue llOefllz2- 
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PROOF. (i) = (ii): By 2.5.1, 


nm 
IFallse = WFnllz2 + D2 Oj Fallz2 Se, 
j=1 
where c is a constant independent of n. 
Thus $771 lO; fnllZ2 < ¢ for p < n. As n — ov, this inequality persists: 
ae lO; f\lZ2 <¢. Letting p — oo gives (ii). 
(ii) = (i): The same procedure as for (i) = (ii). 
To show that D7(RN) is complete, let {f'?} be a Cauchy sequence in 
D?(RN) and set fi” = EB» (f). Then 


k k 
ee ae ce ‘IIb aN) < pe = i ‘llpaRN): 


Since D7(R”) is complete, the sequence { fO Veen converges in D?(R") to 
fn- It is straightforward to show that {f,}nen is a martingale associated 
with the filtration {B,,}; it converges to f € D?(RN). o 


3 Absolute Continuity of Distributions 


3.1 The Gaussian Space on R 


Let g: R > R, g € L?(R). We seek sufficient conditions on g for the direct 
image measure g,V/,; to be absolutely continuous with respect to Lebesgue 
measure on R. 


3.1.1 Lemma. Let @ be a finite positive Borel measure on R. Suppose that, 
for every yp which is C! and bounded on R, 


(i) | i aa) < esup lel) 


Then @ is absolutely continuous with respect to Lebesgue measure d&é on R, 
and its density k is in L*(d€) and satisfies 


[as <cO(R) and k(€)<cdé ae. 


PROOF. Let y be a bounded increasing C! function such that y(—oo) = 0. 


Then 9(€) = f*.. o'(u)du. 
It follows from (i) that, for every nonnegative continuous function y’, 


(ii) [eae se [ewe 


oO 


(ii) extends to nonnegative Borel functions. 
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Hence, for every nonnegative Borel function, f fdu = 0 implies 6(f) = 0. 
By the Radon-Nikodym theorem (IV-6.2.1), 6 is absolutely continuous with 
respect to Lebesgue measure. Let k(£) = ae Then k(€) > 0 and, since 


| / o' (ERE) < esup lol) 


k(€) <cd€ ae. 
It remains to check that the density k = = is in L?(dé). This follows 
from the inequality 


[Pe se f keds = oR).0 


3.1.2 Corollary. If g € L?(1), ||dg||~! € L?(™), and 5(az) € L'(), 
then = gs; 18 absolutely continuous with respect to Lebesgue measure, 
and its density k = du/d& is in L?(dé). 


PROOF. 
/ J @d(gerr)(0) = J; g'(al2))dry = (v'(o(2))I1) 
and 


(y'(g(x))|1) = / d(pog)(a) x roe 


(v2 918a5)) & supecn lotO)If [OG) dv. 


The result follows from Lemma 3.1.1. 0 


3.1.3 Let g € D3(R), the Sobolev space of order 2. Let A = {m7 ER: 
dg(x) # 0}, let 14 denote the indicator function of A, and let. 1,41 be the 
density measure 1,4 with respect to 1. 


3.1.4 Theorem. The image measure g.1,4v; is absolutely continuous with 
respect to Lebesque measure. 


dg(x) ; 
ProorF. Let f(x) = T+ (dg(ay?" Since g € D3(R), we have dg € D?(R) 
and éf € L?(R). Let A. = {x : dg(x) > €}. When x € A, and0 <e <1, 
dg(x)f(2) > 5. 
Let ~% be a nonnegative function defined on R. Then 


/ (ge (doa) f(a)v1) = if (Wo g())dg(x)f («)dv1(x) 
e2 e2 
5 | Weatwdnte) =F [ve lan(ta.n) 


IV 
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Suppose that p4 = g.(141) is not absolutely continuous with respect to 
Lebesgue measure; then there exists a compact subset K of R such that 
Jie UE = 0 and pa(K) > 0. 

Since A = UnAt/ns 


palK) = f 1x(ale))Lale)dnr(x) = fim. f 1x (g(x) Lay (eave) > 0. 


Hence there exists ¢ such that p4,() > 0. Let {un} be a sequence of 
continuous functions on R such that (i) 0 <u, < 1, (ii) limps tun(E) = 
1 (€), and (iii) for some R, un(€) = 0 if |€| > R. 

Set gn (€) = fe pUn(A)dX. Then, by the dominated convergence theorem, 


€ 
dim en(é) = f de Ad =0. 


Moreover, 
/ e ‘ 
(en e916) = (en egddglt) > (09) Aas. 


Since (yy, 0 gid f) > 0 and f yi,(E)p4.(dé) > pa.(K), this gives a contra- 
diction. 0 


3.2 The Gaussian space on R® 


Let g = (g1,---,9a) € L?(RN, v) be a function with values in R72. We now 
seek sufficient conditions for the direct image measure g,v to be absolutely 
continuous with respect to Lebesgue measure on R”. 


3.2.1 Notation. If g = (g1,...,ga) is such that gg € D?(RN) for k = 
1,...,d, we set 


V Ge = (Oign OnGns 1s CL OF G Ris): 
By 2.5.6, 7551 llOj9nllZ2,) < +00 
3.2.2 Lemma. /f 


CO [oe] 
Y_ WOjgellZ20) < +00 and S~ OjgpllZ2,) < +00, 
j=l j=l 


then 
S- (Oj 9419; 9p) 220) < +00. 


j=l 


PROOF. This follows immediately from Hélder’s inequality. 0 
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3.2.3 Notation. We set 


(i) |Vgx|2(x) = Sao (x), 


(ii) (Vgn]V 9p) (x) = 5 8j0e' £)059p(2). 
g=1, 


The series (i) and (ii) are convergent in L}(v). 


3.2.4 Definition. The matrix oi, = ((VgilVg9n)(£)) i= d is called the 


covariance matrix associated with g. 


3.2.5 Lemma. Jf g = (g1,..-,9a) : RN — R? and, for k = 1,...,d, 
gr € D?(RN), then 
Oik E LRN v). 


PROOF. This follows from Holder’s inequality. 0 
3.2.6 Notation. Let g = (g1,..-,9a) : RN —, R?. Suppose that g; € 


D?(RN) for i = 1,...,d and that the inverse matrix o;;,) exists v-a.e. We 
set 
: d 4 
(i) 25k = iar im OG, 
(ii) Zk (21k; 22k sss Zikyes »: 


Lemma. /f g; € Di(RN) fori = 1,...,d and oj! € L4(RN), then 
y; IlzsellZ2) < +00 and, for every C! function py: R? > R, 


(i) YA (eo)a)ennle) = S (ate) 
J 
PROOF. 0;(yY0 g)(z) = 1 ae, (9 9(£))0;9p(x), and hence 


d;(~ og) (2) zjx(x ox (at «)) Soe 59:05 9p. 


This implies (i). 0 


3.2.7 Definition. When g; € D3(RN) fori =1,...,d and a;,' € Di(RN), 


we set 
d d 


bz. =D (Laon — D(VailV(oR))- 


i=1 i=1 


3.2.8 Theorem. For every function  € D7(RN), 


S > (O;vlzi6) 220) = (~|bzx) r2(v)- 


j 
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Proor. Yo, (Ojt|2x) 22 = (uI we 5)(0;4'0}91)) and, recalling that 0; = 
a and 6; = = + Xj, one can easily show that 

6;(fO;9i) = £bj;9; 9: — Oj fO;9:- 
The relation follows by summing over j. 


3.2.9 Proposition. Let py: R? — R and let g be such that g; € D3(RN) 
fori=1,...,d and oj’ € Di(RN). Then 


Op _ 
I. HE, Uo) = fw 0 g)(x)bz4(a)dv(x). 


aes 


Fe (6dla.r) = f SE (ala))av(e) = (2,(e29) leu) 21) = (voglden) 


J 


R! ve 


The last equality follows from 3.2.8. 


3.2.10 Lemma. Let 6 be a finite measure on R°. Suppose that there exists 
a constant C’ such that 


for every bounded C' function y on R‘. 
Then 6 is absolutely continuous with respect to Lebesgue measure dé on 
Rk’, 
PROOF. For the case d = 1, see 3.1.1. We prove the lemma when d = 2. 
Let y~ be a compactly supported C! function on R?. We first show that 


1/2 
i 2 Z i 09 | in 7 dy 
(ii) Gia Fa dryads) < 5 oe dxydxz + Bey 


To see this, let 
v(z1) = sup |y(x1,72)| and w(x2) = sup |p(r1, r2)|. 
r2ER x,E€R 


7) Srp a atic 
€eR?¢ 


ards, : 


Then 
(iti) a, \yl?da dary < [oenaes f w(ea)ars, 
R2 
Since 
+00 | ¢ 
; 0 
(iv) v(ai) < I. 5 (ars) dx2 
and 
+00 ra) 
(v) w(x) < fo |S (ar) dxy, 


(ii) is proved. 
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Let u be a nonnegative continuous function with compact support such 
that 


(vt) ae u(@1,22)dx\dzrq = 1. 


For € > 0, we set u.(z) = 5u(#) and 


Yer) = ie Ue(x — A)O(dA). 


For every continuous compactly supported function y, 


Ihe (= \2 = (z)dx 0(dd) 


. i a u(2)h(A + ex)dz 6(dd), 


I 


| pela)b(a)de 
R2 


Since w is continuous, ~(A + €z) tends to (A) as € — 0. It follows that 


(vii) lim ‘| pel) b(a)de = i (0) (dd) 


The measures (x) thus converge vaguely to 6(dx) as « > 0. 


If w is C!, 
Oe = OW 
i “de = | [ eghae 


It follows from (i) and (vii) that 


Oe 
wdx| <C sup |w(A)|. 
[3 sup [¥O)] 
Hence 
te dy. |” 
tee Gi 
(viit) i. Dn, dx <C 


For every €, by Holder’s inequality, 


| [ vas < | f iF] [wr “ 
fons e( fo)" 


It follows that 


lim | [ eevee 
e—0 
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The mapping w +> f ~(A)6(dA) is thus a continuous linear functional on 
L?(dz). This implies the existence of k € L*(dxr) such that 


[vonaar = [ v@r(o\ar.0 


3.2.11 Principal theorem. Let g = (g1,..-,9a) : RN — R? be such that 
GE Di(RN) fori =1,...,d. 

Let oix = (VgilVgox) be the covariance matrix. Suppose that o~' € 
D3(RN). Then the image measure g.v is absolutely continuous with re- 
spect to Lebesgue measure on R!. 


PROOF. By 3.2.9, 


26 ag 
L Zeaan 


< sup |y@)I- \52x()|dv(2). 
€ER4 


Let C = sup, f |6z%(x)|dv(z); then C < +00 and hypothesis 3.2.10(i) is 
satisfied. 0 


Appendix I 
Hilbert Spectral Analysis 


The spectral theorem in finite dimensions makes it possible to write a 
Hilbert space as a direct sum of eigenspaces of a hermitian endomorphism 
u. If the dimension is infinite, direct sums are replaced by “continuous 
sums”. We will apply Bochner’s theorem to obtain the spectral theorem by 
Fourier analysis. 


1 Functions of Positive Type 


Let f be a function defined on an abelian group G. f is said to be of positive 
type if, for any given gi,...,gn € G, the matrix 


(f(g —95)), L<tg<n, 
is positive hermitian. That is, 


So AAS (Gj — 9k) 20, VA1,---,An EC. 
ik 


In particular, taking a single element, we find that the matrix 


( Fa) #10) ) 


is positive hermitian. That is, 


11. f(0) >0, F(9) = f(-g), and | f(g)? < f(0). 
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Let P(G) denote the set of functions of positive type on G. Observe that 
P(G) is a cone: 
Af+phe P(G) Vf,he P(G) and A,wE R*. 
1.2 Proposition. Let [ be an abelian group and let [ be its dual. Then 
(ML(D))* c P(P). 
PROOF. Let p € M{(L), A; € C. Then, writing (7,4) for 7(7), 
00) = SAGs — FAK = Of 5s FA Reda. 
QT 


But 


2 


So Re — Fe) = Dw GW) = [Dar ww) 


whence 
q(A) = { Sov) 
r 


Algebra structure of the cone of functions of positive type 


2 
dut(y) = 0. 


Proposition. Let f and h be functions of positive type on the abelian group 
G. Then their product fh is of positive type. 


PrROoF. Set k = fh and let g1,....gn € G be given. We consider the matrix 
k(gi — 93) = F(9i — 95) h(9i — 95) 
and apply the following lemma. 


1.4 Lemma. Let (Aj) and (Bj), 1 < i,j <n, be positive hermitian matrices. 
Let 
Ci=A\Bi, 1<ij<n. 
Then Cj is a positive hermitian matrix. 
PROOF. Let X}, (respectively Y;) be an orthonormal system of eigenvectors of 
A (respectively B), and let ja (respectively yg) be the corresponding eigenvalue. 
Then 
i ix i ivi 
Ay =" aXaXq and By = >> yw¥sYo. 
a B 
Hence ae ae 
C= pa XnYiXaY p. 
a8 
Set Z! X,Y3 and pa.g = Pos. Then 


ag 
a 4 FI 
Ci = 5 Pea bLZo,62 0,8: 


oA Zi.9Za.8 = 15 Za 


the matrix Alan a is positive. C is thus a linear combination, with positive 
coefficients, of positive matrices, and therefore is positive. 


Since 
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2 Bochner’s Theorem 


Bochner’s Theorem. Let Z be the group of integers. A function f on Z is 
of positive type if and only if there exists u € M,(T) such that fa(n) = f(n). 
PROOF. (<) This follows from 1.2. 
(=) Consider 

gr(n) =r!" where r€ 0,1). 


Then P, (n) = g,(n), where P,(@) denotes the Poisson kernel (see III-2.2.1), 
and thus g, € P(Z). By 1.3, k, = fg- € P(Z). Moreover, by 1.1, 


lAr(r)| < [f(O)|r!"". 
Set 
(i) R= kane: 
The right-hand side is an absolutely convergent series and k,(0) € C(T). 


Next, let \p = e7'?? if |p| < N and \, = 0 otherwise. Then, since 
k, € P(Z), 


. 1 - 
(it) (SGN 0) = So ApAgkr(p— 9), WN €Z. 


We now rewrite Gy(@) in a slightly different form by noting that Raha es 
e(9-P)® and summing over p— q = 7: 


Gr@= > (1- st) etmintny 


|n|<2N-1 


Letting N — +00, the absolute convergence of (i) and inequality (ii) show 
that 


(iit) k,(0) > 0. 
A positive linear functional can thus be defined on C(T) by setting 
2n 
- dé 
Iu) = [ u(Oin 05. 
Integrating the series in (i) term by term yields 
Qn dé 
(iv) Wer || = %-(1) = kr (8) 5— = £(0). 
0 “ue 


Moreover, l,.(e’%?) = f(q)r!4!  f(q) as r > 1. 
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Hence, if Q(@) is a trigonometric polynomial, 


(v) lim 1,.(Q) exists. 


ral 


Since the trigonometric polynomials are dense in C(T) (III-2.2.8) and 
the 1, are equicontinuous by (iv), it follows that 


lim l,(u) exists for every u € C(T) 


and defines a positive linear functional, that is a Radon measure pu € 
M,(T). In particular, 


E(n) = Tim L-(e'®) = f(n).0 


3 Spectral Measures for a Unitary Operator 


Let H be a complex Hilbert space, with hermitian inner product (h,|h2) 
and norm (h|h) = |h||?. A linear operator U is called unitary if it is invert- 
ible and U* = U~!. Recall that the adjoint A* of a linear operator A is 
defined by the identity 


(Ahi |h2) = (hy |A*h2). 


Theorem on existence of spectral measures. Let U be a unitary op- 
erator on the Hilbert space H. For a trigonometric polynomial P(@) = 
One lel 


(i) PU Sy Cn 


Given h € H, there exists a unique un © M4(T) such that, for any trigono- 
metric polynomial P, 


(ii) (P(U)h|h) = vs P(B)djin(6). 


Ltn ts called the spectral measure of U relative to h. 


4 Spectral Decomposition Associated with a Unitary Operator 257 


PROOF. To prove the uniqueness of u;,, write (ii) for a trigonometric poly- 
nomial consisting of a single monomial. This gives 


ji(m)=(m), where +(m) = (U™hlh), 


and uniqueness follows from III-2.2.8. To prove existence it suffices, using 
Bochner’s theorem, to prove that y(m) is a function of positive type on Z. 
We must therefore consider the sign of 


T= So ApAg(P - @): 


Pq 


Since U is unitary, U-' = U*, whence >(p — q) = (UPA|U%h). Thus 
ApAqY(P — 4) = (ApUPh|AGU7h) and 


T= So (ApUPh|AgU4h). 
Pd 
But this can be written 


I= (= ApUPh| S> 0%] = 
Pp q 


2 
>0.0 


Soa 


4 Spectral Decomposition Associated 
with a Unitary Operator 


Theorem. Let U be a unitary operator on the Hilbert space H. Let L*° (Br) 
be the algebra of bounded complez-valued functions which are measurable 
with respect to the Borel algebra of T. Then there exists an algebra homo- 
morphism 

®: £°(T) — End(H) 


that associates the operator U with the function e° and preserves conjuga- 
tion. That is, 


(@(f))* = ®(f). 


PROOF. Recall that the scalar product on H can be obtained from the 
norm by the following polarization identity: 


A(ha|h2) = [|ha + hall? — ||hi — hall? + |léha + hall? — llita — hel|?. 
Polarized spectral measures are defined by setting 


AY ho = Hhith2 “S% Hhi—he ae Hihy+he "x Mih,—he:- 
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Thus, for every trigonometric polynomial P, it follows from polarizing 3(ii) 
that 


(i) (P(U)ha|ha) = I P()d%n, no (8). 


Fixing f € £* (Br), we define a sesquilinear functional gs by 


rhe a 10)dny.na(8). 


This integral is well defined since f is a bounded Borel function. We have 
the following upper bound: 


lap(hishe)| < llAall UlAell Wf llee- 


Hence fixing h; gives a conjugate linear functional in he, and this form is 
represented by a scalar product. There exists a bounded linear operator 
®(f) such that 


(ti) (®(f)hi lhe) = [feta Vhy, he EH. 


Moreover, when f, converges to f while remaining bounded, Lebesgue’s 
dominated convergence theorem shows that 


(277) (®(fn)hilh2) > (®(f)halha). 
In order to show that ® is an algebra homomorphism, it suffices, using (iii), 
to check the assertion for trigonometric polynomials. In this case, (ii) and 
(i) show that ®(P) = P(U), and the formula 

©(P; Pz) = ®(P;)®(P2) 


clearly holds. Finally, by the polarization identity, Yn... = Vr;.nr,+ which 
implies that 


(®(f)hilh2) = (hi|®(f)h2). 
Corollary. Let A € Br. Then ®(1,4) is an orthogonal projection and 
(14) ®(1B) = (Lanz). 
Proor. (®(14))* = ®(14) = (1,4) and (®(14))? = €(14) = (14). 
These properties characterize orthogonal projections. 0 


Corollary (Spectral decomposition). Let P(H) denote the set of closed 
vector subspaces of H. Let T(H) be given the structure of an abstract 
Boolean algebra, with products given by intersections and complements by 
orthogonal complements. Then ® defines a homomorphism ~ from the 
Boolean algebra By to T(H) by setting 


(A) = Image of ®(14). 


Moreover, 


U(p(A)) c 9A). 
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5 Spectral Decomposition 
for Several Unitary Operators 


Let U1,...,Un be n pairwise-commuting unitary operators on the same Hilbert 
space H: 
U.K, = UiUrp, L<kl<n. 


With every trigonometric polynomial 
PGinct l= ieee ee 
on T”, we associate the operator 
PU is US -Crinamely ee 


5.1 Theorem on existence of spectral measures. To every h € H there 
corresponds a positive measure tn on T” such that 


(P(U1,...,Un)h|h) = i P(6)du,(8). 


This is proved by generalizing Bochner’s theorem from Z to Z". Theorem 5.1 
leads to the simultaneous spectral decomposition of the operators Uz, 1 <k <n, 
i.e. a representation of £°(Byn) in End(H). 


Appendix II 


Infinitesimal and Integrated Forms 
of the Change-of- Variables Formula 


In this appendix, we give a new proof of Theorem IJ-4.4. The variational 
method used here, coupled with the ideas of Chapter V, yields a proof in 
the setting of Gaussian spaces. 


1 Notation 


Let ys be a Borel measure on Euclidean space R”. Let {T; : t € [0,1]} be 
a family of R"-valued measurable mappings, defined on an open set D of 
R” and with the following properties: 


(i) T; : D> D' CR” is a diffeomorphism. The inverse diffeomor- 
phism is denoted by A. 

(ii) Vz € D the mapping t — T;x is differentiable. The differential 
is denoted by (£7;) (2). 

(iii) Vt € [0,1] the direct image (A;)« under A; of the measure p is 
absolutely continuous with respect to yz. The density is denoted 

d((At)« 

du 

Let f : R" — R? be differentiable. Js(x) denotes the Jacobian of f at the 

point x. 


262 Appendix II. Change-of- Variables Formula 


1.1 Definition. The vector fields 
Zily) = Tr; | (Ary) 
ly Ht ty 


are called velocity fields associated with (7+) +e[0,1)- 


REMARK. y +> Z;(y) defines not only a vector field on D’ but also a differ- 
entiable mapping from D’ to R”. 


1.2 Definition. Let Z; be a vector field defined on D. Z is said to admit 
a divergence with respect to y if there exists a function 6,2 : D — R such 


that 
[| ueorzteyae == | 11099, 200 


for every differentiable function f :R” — R with support contained in D. 


2 Velocity Fields and Densities 
2.1 Theorem. Let Z, be the velocity field associated with T;. Then the 


density Gi(x) = U(At) 4) 


di is given by 


Gil) = Go(2) exp | : oy(Za)(Ta)as ae. du. 
PROOF. 
(i [ tereiaraucey = fF F(Aandute 
Differentiating with respect to ¢ gives 
£ f(A) = Jp Aa) Ady). 


Furthermore, 

(2) J(poar)(y) = Jp (Ary) Ja, (y), 

and hence 

(i £ FAqy) = Jyoay (uta, (uy? Ary) 
dt dt 

Since 


(iv) (T,° At)y = y, 
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we have 
Jr (Ary)Jay = Id, 


whence (J4,y)~! = Jr,(Ary). Differentiating (iv) with respect to t gives 


(v) (52) (Ary) = —Jr, (Aa) Aww 


Substituting into (iii), we find that 


Gf Aw) =~Jyoanty) (FT) (Aa) 


and 


il 


fF flaaddny) =f Pena i.2)(vddutw) 


(vi) 


a f(x)(5,.Z,)(Tex)Gul@)du(sr), 


where the first equality follows from Definition 1.2 and the second from (i). 
Differentiating (i) with respect to t shows that 


£Gu(a) = (6, 24)(Tir)-Gy(x)  prae. 


oO 


2.2 Corollary. Let u = dx be Lebesgue measure on R” and suppose that 
To =Id. Then Vt € [0,1] 


| f(Tx)|det Jp,x|dx = | f(2')dx’', where D' =T,(D). 
D D’ 
PROOF. It suffices to verify the relation 

d 

rr log detJy,x = 6(Z,)(T:z), 


where Zain (Sn) (Ary) 


and 6Z is the divergence of Z with respect to dx. To do this, we use the 
following two lemmas. 
2.3 Lemma. d 

qe Pte) (Y) = (Jz, )(Tiy) 2 Ir. 


PROOF. 


d 
= (Jr, )(y) =. Jar) — J(z,07,) (Y) = (Jz,)(Ty) 2 JIry.O 
dt dt 
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2.4 Lemma. Let (B;) be n x n matrices such that 4B = M,B:, where 
the (M;) are alson x n matrices. Then 4 log |det B,| = trace M;. 


ProorF. Let ®;(t) be the ith column of B;. It follows from 


d 
qoils = 2 (Men Sel); 
that ; 
Fri) = Di MdinPe(- 
k 
Hence 


< det (t), ...,@,(t)| = S7(M,)udet[®, (t), ...,@,(t)].0 


u 


CONCLUSION OF THE PROOF OF COROLLARY 2.2. 
6(Z:)(T;) = trace(Jz,)(Tia).0 


REMARKS. (1) Compare 2.2 with [-4.4.1, the change-of-variables theorem. 
(2) Let T, =14+tM, where M is an n x n matrix. Suppose that J +¢M is 
invertible for every t € [0,1]. Then 4 +tM) = M(I+tM)"\(1+tM). 
Letting A denote the exterior product, we can express the determinant of 
A+TIas A(t) = det(I +tM) = SYy_o(traceA* M)t*. By 2.4, 
A’ ~ 
x) = trace M(I+tM)~°. 


Thus 


t 
det (I +¢M) = exp [ trace (M(I+sM)~')ds. 
0 


2.5 Corollary. 
d : 
Flot (LD) = f 6Ze(y)dy. 
dt T,(D) 


PROOF. By 2.2, 
vol (T;(D)) = i, |det Jr,(x)| dx 
D 


and 


< [vol (T;(D))]| = A E log det Jr, )| x [det Jr, (x)|da 


lI 


[ 2a (tex)lae Jr,(x)|dx. 
D 


Applying 2.2 once more proves the assertion. 0 
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3 The n-dimensional Gaussian Space 


exp(-3 Jae 
Let R” be given the measure p = J]j_, aos and let (zly) = 


dL1 Ziyi denote the scalar product of two vectors z, y € R”. 


3.1 Lemma. Let Z be a differentiable vector field on 8 C R”. Then Vx € 6 
6.Z(x) = trace Jz(x) — (Z(2)|z). 


3.2 Theorem. Let (T;):¢j0,1) be the mappings defined in Section 1 and let 
d((At)« 
A, (rz) = ee Then 


Hee) Td Tae - i (Ea(Tua) Te) ae. 


PROOF. This follows from Lemma 3.1 and Theorem 2. 
EXAMPLE. Translations of the Gaussian space. 


For a differentiable mapping h : R” — R”, set Tr = x — A(x) and 
Tix =x —th(x). Let A; be the inverse of T;. Then Ayxz = x + th(A;2). 
The velocity fields associated with T; are 


2(2) = (Sn) (Arn) = =h(Ar2). 


We have 
(Z;(T5x)|Tsx) = —(A(x)|x) + s(h(x)|h(z)) 


and 
ex|- / Eu(Taa) Taz) = exp t(h(2)ie) - Fatepiate)} 


Compare this with the Cameron-Martin theorem (V-1.8.3). In particular, 
ifTx =x —y and Tia = 2 — ty, then Ayz = « + ty, det(J7,(x)) = 1, and 


REMARK. This method can be extended to the infinite-dimensional Gaus- 
sian space. 


Exercises for Chapter I 


Problem I-1. If G is a family of subsets of a set X, we denote by a(G) 
the Boolean algebra generated by G and by o(G) the o-algebra generated 
by G. A partition of X is a family P = {P;}j;e, of nonempty subsets of X 
such that P,P; =@Qifi#~j and Ujes = X. 
(1) Let P = {Pj}je7 be a partition of X. Characterize 

(a) a(P) if J is finite, 

(b) a(P) if J is infinite, 

(c) o(P) if J is finite or countable, and 

(d) o(P) if J is uncountably infinite. 
(2) Show that the family A of subsets of X is a Boolean algebra generated 
by a finite number of elements if and only if there exists a partition P = 
{P3}jes, with J finite, such that A = a(P). 
(3) Let A be a o-algebra on a countable set X. Show that there exists a 
partition P of X such that A =o(P). 
(4) Show that a o-algebra never has a countable number of elements. 


Problem I-2. Let G be a family of subsets of a set X such that X € G and 
G is closed under finite intersections. An r-family is a family R of subsets 
of X which is closed under finite intersections of pairwise disjoint sets and 
such that, if B; and By € R with By, C Bo, then By \ By € R. Let r(G) 
be the smallest r-family containing G. Show that r(G) equals the Boolean 
algebra a(G) generated by G. 
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METHOD. Consider the families 


Ri {B:Ber(G) and ANBer(G) VAEG} and 
Re {B:Ber(G) and ANBer(G) VAeEr(G)}, 


and show that they are r-families. 


Problem I-3. Let G, and G2 be two nonempty families of subsets of a set 
X which are closed under finite intersections. Let A,, Az, and A denote 
the o-algebras generated by G1, Go, and G; U Go, respectively. Let P bea 
measure of total mass 1 on (X,.A). Show that if 


P(A, ial Ag) = P(A,)P(A2) for all Ay €G, and Ag € Ga, 


then the same equality holds for all Ay; € A; and Ag € Ag. 
METHOD. Consider the families 


M, = {A:A€A and P(AM Ag) = P(A)P(Az) VAo € Go} and 
My {A: AEA and P(A,M A) = P(A;)P(A) VAi € Aj}, 


and apply the theorem on monotone classes, using Problem I-2. 


REMARKS. 1. This result is especially useful in probability theory. Thus, if 
X =R’, Ay(x) = {(@1, 22): v1 < x}, and Ao(y) = {(21, 242): v2 < y}, 
then G; = {A;(z) : «x € R} and Gg = {Aa(y): y © R} are closed 
under finite intersections and A is the set of Borel subsets of R?. If P is a 
probability measure on (R?, A), it is the distribution of a pair (X1, X2) of 
real random variables. By Problem II-3, (X 1, X2) is a pair of independent 
random variables if and only if 


PIX, <2; X_< yl = P[X <a] P[X2 < y| 


for all (x,y) € R®. 
2. The result can be extended from two factors to n factors by constructing 
monotone classes M; for k = 1,2,...,n and using induction on k. 


Problem I-4. Let ¢ = {z,}?2 9 and let 
CS {2 :In ER Vn EN and |zl|. =sup|zn| < oo} 
n 


Define T : © — €° by (Tx)o = Xo and (T2)n = Fn — Ln—1 if n > 0. 

(1) Ife = (1,1,...,1,...), show that the equation Tx = e has no solution 
x in £%. 

(2) Let F = Té~ be the image of T. Assume without proof that there exists 
a continuous linear functional f on @° such that f(x) = 0 for every x in 
F, f(e) = 1, and sup{|f(z)| : ]z]]o. < 1} < +00 (Hahn-Banach theorem). 


Show that if x = {x,,}°2, is such that x, > 0 for every n, then f(x) > 0. 


(3) Let S : £° — &€* be defined by (Sv)p = 2n41 if n > 0. Show that 
f(x) = f(Sz) for every x in £~. 
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(4) Show that liminf,.4..2n > 0 implies that f(z) > 0. Conclude that 
liminfp4ootn < f(x) < limsup,_..., En for every x € £°. 

(5) Let A Cc N and let 14 € © be defined by 14(n) = 0 ifn # A and 
la(n) =1lifne A. If P(A) = f(14), show that P(AU B) = P(A)+ P(B) 
if AN B =@ and that P does not satisfy the countable additivity axiom. 


REMARKS. The linear functional f above is called a Banach limit; it cannot 
be written down explicitly since it is constructed by means of the Hahn- 
Banach theorem and the axiom of choice. Similarly, it is impossible to 
give an explicit example of an additive but not o-additive measure on a 
o-algebra. 


Problem I-5. Let X be an uncountable set and let A be the o-algebra 
generated by the family of 1-element subsets of X. (See Problem 1, question 
(1d).) Let P : A — [0,1] be defined by 


P(A)=0 if A is finite or countable 
P(A) =1 if A is cocountable. 


(A is cocountable if A‘ is finite or countable.) Show that P is a probability 
measure on (X, A). 


Problem I-6. Let (X, A, jz) be a measure space and let f be a nonnegative 
measurable function on X. For every t > 0, set 


F(t) =p{a: f(z) >t} and G(t)=y{x: f(x) > th. 
(1) Assume that f(X) C N and that f is integrable. Prove that 


I f(x)du(x) = S> F(n) = 32 Gn). 
n=0 n=1 


METHOD. Set fn = p{x : f(x) = n} and show that fy f(x)u(dx) = 


dina Hn- 
(2) Assume that f® is integrable for a > 0. Prove that 


+00 +00 
a a2 a-l = a-l 
: FOr Cee i 1° F(t)dt =a [ t°-1G(t)dt. 


METHOD. Show that (2) holds for a = 1 by considering the functions 
fn(x) = 2 i@ where [a] means “the greatest integer < a”, and using the 
monotone convergence theorem. The general case can then be reduced to 
the case a = 1. 


Problem I-7. If 0 < r < 1, we write the Poisson kernel as 


1—r? 


lee) 
P.(@) =14+2 0 Oise ee, 
(8) 7 Bi ar 1 — 2r cos 6 + r? 


n=1 
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(1) Show that r? + cos @(1 — 2r) > 0 if 0 <@< 7m and 3 <r< 1. Deduce 

that @?P,.(@) < fee and evaluate lim, [7 0?P,()d0. 

(2) Show that [> 6°P,(0)d0 = ~ +40, 4) and use this to derive 

another expression for lim,_,, cs 6? P,.(0)d0. 

(3) Use (1) and (2) to find the sums of the series 37 ~_, cy et ee 

oo 

and oy ro 

(4) Express f, (log(1 — x))?& as the sum of a double series and show that 
1 0° 

Jo Qog( ~ 2)? Se = 2000 ae 


(-prt 


dx 


Problem I-8. Evaluate S>~ ae 


n=1 n 
the monotone convergence theorem. 


by using the integral Hie and 


Problem I-9. Let (X,A,j) be a measure space and let x + f(x) = 
(filz), fo(x),...,fn(v)) be a measurable mapping from X to R”. Suppose 
that R” is equipped with a norm || || such that x + || f(x)|] is integrable. 

(1) Show that f; is integrable for every j = 1,2.....n. 

(2) Defining fy f(x)du(x) in R” by 


show that || fy f(x)u(de)l| < fy If (@)lla(de). 


METHOD. On the dual space (R")* consisting of linear functionals a: vu i> 


(a,v) on R"”, introduce the dual norm |ja||* = sup, zo ber and use the 


l(a,v)| 


llall* * 


fact that |v] = sup, zo 


REMARKS. 1.The shortest path between two points is a straight line. Con- 
sider R” with the Euclidean norm |jv|| = [vp + v3 +--- v2]? Let X = 
[0,1] with Lebesgue measure. (See Chapter II.) Let F be a function from 
[0,1] to R” such that the derivative f = F’ exists everywhere and is con- 
tinuous. Then fo || f(z)\|dx can be interpreted as the Euclidean length of 
the curve described by F, and || is f(x)dz|| = ||F(1) — F(0)|| is the length 
of the line segment with endpoints F'(0) and F'(1). 

2. Case of equality. It can be shown that, when the unit ball B is strictly 
convex (that is, when |/v;|| = ||vol} = ||Avi + (1 — A)vel]| = 1 for A € [0,1] 
holds only for \ = 0 or 1), the inequality is strict unless there exist v € R” 
and a function g(x) > 0 such that f(x) = g(a)v p-almost everywhere. The 
application to the Euclidean length of a curve is immediate. 


Problem I-10. Let X,X,,...,X,.... be measurable functions from a 
space (F,€, 4) to an open set 2 of Euclidean space R? such that 


Ve >O p{||Xn—X|| eh) - 0 asn- ox. 
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(1) Show that Ve > 0 there exists a compact set K C 9 such that u({X ¢ 
K}) <€ and, for every n, u({Xn ¢ K}) <e. 
(2) If f : Q > R™ is continuous, then Ve > 0 

HCI F (Xn) — FOI 2 ef) +0 as noo. 


Problem I-11. Let (X,.A,y) and (Y,8,v) be measure spaces such that 
w(X) and v(Y) > 0. Let a: X — C and b: Y > C be functions, respec- 
tively A and B measurable, such that 


a(x) = b(y) 2 ®v-almost everywhere on X x Y. 


Show that there exists a constant \ such that a(x) = X p-a.e. and b(y) = A 
V-a.e. 


Problem J-12. On a measure space (X,A,), let f and g be complex 
functions such that |f|? and |g|? are y-integrable and consider the function 


h(z,y) = |F(x)g9(y) — F(y)g(@)P. 


(1) Show that 0 < fy, h(z,y)du(x) du(y), and use this to prove the 
Cauchy-Schwarz inequality: 


2 
< ie LF (0) Pdp(c) i, la()|2du(c). 


/ fla)g(@)dy(e) 
xX 


METHOD. Consider first the case where f > 0 and g > 0. 

(2) Show that equality holds in Schwarz’s inequality if and only if either 
g(x) = 0 pra.e. on X or there exists a constant A € C such that f(x) — 
Ag(x) = 0 p-a.e. on X. 


METHOD. Problem J-11 can be used. 


Problem I-13. If X and Y are measurable real-valued functions defined 
on the measure space (2, A, 44) such that u({Y < x < X}) =0 for all real 
x, show that w({Y < X}) =0. 


Problem I-14. Let (X,.A,) be a measure space, where (X) is not nec- 
essarily finite, let (Y,B) be a measurable space, and let f be a measurable 
mapping from X to Y. Suppose that there exists a sequence {B,,} in B 
such that U%_,B, = y and u(f~!(Bn)) < co for every n. 

(1) Show that v(B) = «(f~'(B)) defines a measure v on (Y, B) (called the 
image of yz under f). 

(2) Show that if g € L1(v), then 


/ aC f(a))u(de) = i a(u)u(dy). 
xX ¥ 


REMARKS. 1. The image measure always exists when py is bounded; this is 
used extensively in probability theory, in Chapter IV. It does not always 
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exist if 4(X) = +00. For example, if X = R? is equipped with Lebesgue 
measure pp = dx dy and f : R* ~ R =Y¥ is the projection f(x,y) = x, the 
image of yz does not exist. 

2. If X and Y are metrizable locally compact spaces which are countable at 
infinity and 4 is a Radon measure on X, a sufficient condition for existence 
of the image measure is that, for every compact set K in Y, f~!(K) should 
be relatively compact. See problems II-11, 12, and 13 and III-3. 


Problem I-15. (1) Let f be square integrable on [0,1] and let F(x) = 
i f(t)dt. Applying the Cauchy-Schwarz inequality to the product f x 1 
on (0, x], show that lim, ,) a~!/2F (x) = 0. 

(2) Let g be square integrable on [0,+00) and let G(x) = fy g(t)dt. Ap- 
plying the Cauchy-Schwarz inequality to the product g x 1 on [a,x], with 
a sufficiently large, show that lim,.45. v7 !/?G(x) = 0. 


REMARK. It is easy to replace L? by L’, with p > 1. If 2 + ; = 1, we find 
that 2—!/4F (x) + 0 as x > 0 and x7 !/4G(x) — 0 as x > +00. 
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Problem II-1. Let J be an open interval in R, equipped with the Borel 
algebra B. A function F : I > R is called increasing if x < y implies that 
F(a) < F(y). We set F(x — 0) = limysz F(y), F(a +0) = limy), F(y), and 
Dr ={x:F(x-—0) 4 F(x +0)}. 

(1) If F : I — R is increasing, prove that Dr is finite or countable. 


METHOD. If [a,b] C I, show that D(n;[a,6]) = {xz € [a,b] : F(2# +0) — 
F(a -—0)> +} has a finite number of elements. 


(2) If F : I — R is increasing, prove that there exists exactly one measure 
Lt > 0 on (J,B) such that 


F(y) — F(z) = p({[z, y}) 


for all x, y such that [x,y] C I and 2, y ¢ Dr. 
Prove that u({a}) = F(a +0) — F(a — 0) for every a in J. 


METHOD. Uniqueness: Use the fact (II-3.2) that a Borel measure that is 
locally finite on an interval is regular, and hence determined by its values 
on open sets. 

Existence: Imitate the construction of the Riemann integral. For every con- 
tinuous function f with support contained in J, define the integral f fdu 
as the limit of integrals of step functions 


do 9(ai)(F (a) = Bisa): 
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(3) Let yz be a locally finite nonnegative measure on (J, B) and let x9 € I. 
Set F(x) = pe((vo.x)) if e > xo and F(x) = —p((x,29)) if c < xp. Show 
that F’ is increasing and that F(y) — F(x) = p([z,y]) ify ¢ De. 

(4) Let a relation on the set of increasing functions on J be defined as 
follows: F\ ~ F if there exists a finite or countable subset D2 of J such 
that Fi(y) — Fi(a) = Fo(y) — F(x) for all x and y € I\ Dy». Show that 
this defines an equivalence relation on the set of increasing functions on J. 
Characterize the equivalence classes in terms of measure. 


REMARKS. 1. Since perhaps as many as 90 per cent of the measures used 
in practice are measures on R, a description of all the Radon measures > 0 
on an open interval is important. Historically, the first measures > 0 were 
considered by Stieltjes, precisely by means of increasing functions. 

2. With every increasing function F’ on an open interval J, we can thus 
associate a measure p(dx), which is often written dF(x) or F (dx). Con- 
versely, given a measure jt > 0 on J, an increasing function F satisfying the 
hypotheses of part (2) is called a distribution function for yu. As we have 
seen. a distribution function for jz is not unique; we can modify (slightly) 
its value at points of discontinuity (the atoms of j) and add an arbitrary 
constant. When y is a probability measure on R, there are three traditional 
choices for distribution functions: 


Fy (2) = w((—o¢, 2), P(e) = w((~00,2]), and F3(x) = slFile )+ F(x). 


The third appears in the inversion formula for a characteristic function. 
3. If we consider a measure ys > 0 on a closed interval of the form (—oo, 6], 
[a, +00), or [a, 6], we can define its distribution function as above. However, 
two measures can then have the same distribution function but different 
masses at the endpoints of the interval. 

4, Many identities and inequalities use increasing functions on an interval. It 
is essential to express the latter in terms of measures in order to understand 
the former; this also gives a systematic method of proof, although not 
necessarily the shortest. 


Problem II-2. Specify for which measure on the open interval I each of 
the following increasing functions is the distribution function (see Problem 
II-1). 


()T=R 

(a) F(a) =a (b) F(x) = [a] (c) F(z) = + arctan x 
Qt Shed 

(a) F(z) =tan % (b) F(x) = (signz)|a|'/?_ (c) F(x) = 4arcsin x 
(3) I = (0,+00) 

(a) F(x) = log (b) F(z) = -[3] (c) F(z) = (x-1)* 


(Notation: [a] = sup{n:n € Z and n < a}, at = sup{0, a}, and sign a = 
+1 if a > 0, sign 0 = 0, and sign a = —-1 if a < 0.) 
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Problem II-3. Let J be an open interval in R. A function G is called 
convex if its right derivative lim.;o9[G(x + €) — G(z)] = G‘.(x) exists for 
every x in J and the function z + G'‘, (x) is increasing. (See I-9.2.1.) 

Prove that G is convex if and only if there exists an increasing function 
F on TI such that, for every x in J, 


G(x) — G(xo) = 1 F(t)dt. 
xo 
METHOD. For one direction, show that G!,(x) = lim,,9 F(x + €). For the 
other, consider H(x) = Te. G',(x)dt and use without proof the fact that, 
if a function has a right derivative that is zero in an open interval J, it is 
constant in J. 


REMARK. It can be shown that the definition of convex functions given 
here is equivalent to the following property: 


G[(Av + (1 —A)y] < AG(x#) + (1 -—A)NG(y) ifa, y Ee J and A € [0, 1). 


For a proof of this equivalence and further details of convex functions, the 
reader may consult Artin! or Zygmund?. 


Problem II-4. Let J be an open interval in R. Recall (see Problem II-3) 
that a function G : I — R is called convex if there exists an increasing 
function F' on I such that, for every zo in J, 


6a Gane / ” P(t)dt. 


If yz is the measure on J given by the distribution function F' (see Problem 
II-2), prove the following assertions. 
(1) If zo < x, with x and xo € I, then 


Gla) - Gao) = (@=20)F (ao +0) + i, Ley 2) (t)(2 — u)u(du) 


(a — xo) F (ap — 0) + [reoailte — u)p(du). 


(2) If xo > x, with xq and x € I, then 


G(x) - G00) = (e~20)F (eo + 0) - i Le,co) (tt) ce — 1) gud) 


(a — 0) F (0 — 0) - i Vreven) (ts) (a — 18) (dt). 


'B. Artin, The Gamma Function (New York: Holt, Rinehart and Winston 
1964), 1-6. 

2A. Zygmund, Trigonometric Series (Cambridge: Cambridge University Press 
1959), 21-26. 
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REMARKS. If uw has no atoms and xp < x, we can replace the notation 
iF Liag,2](u)g(ujdu = iF Ljxo.2](u)g(u)du by es g(u)u(du), since the lat- 
ter is unambiguous in this case. If « < 2x9, we write ce g(u)u(du) = 
—J; phe oo] (u)u(du), which permits us to state the relation of Chasles: 
= =f? + ie for arbitrary a, 6, and c in J. However, this relation does 
not hold if 2 has atoms. 


Problem II-5. Let A, be the set of measures ps > 0 on (0, +co) equipped 
with its Borel algebra, such that iG 1(z,400)(u)up(du) < 00 for every x > 0. 
(1) Let G be a convex function on (0, +00) (see Problem II-4) such that 
lim; +. G(r) = 0. Prove that there exists a unique pz in M, such that 


+00 
(i) G(x) = [ (u—2x)*p(du) for every x > 0, 


where at = max(0,a), and that ie up(du) = limz.o G(z) < +00. 

(2) Conversely, let 4 € MM). Show that (i) defines a convex function G on 
(0, +00) such that lim;+4.. G(x) = 0. 

METHOD. Let F(x) be as in Problem II-4 and show that F(z) < 0 and 
that lim, 4. 2F (x) = 0. Then use Problem II-4. 


REMARK. The measure zy(dx) is not necessarily bounded: G(x) = + gives 
u(dx) = 3 

Problem II-6. Let M be the set of measures v > 0 on (0,-+00) equipped 
with its Borel algebra, such that v([z,+00)) < +00 for every z > 0. If 
k is a positive integer, we denote by C; the set of functions g defined on 
(0, +00) such that G(x) = (—1)*~1!g*-)) (x) exists and is convex and also 
that limg4.6 g(x) = limg_.4.. G(x) = 0. 

(1) If g € Cy, show that there exists a unique v in M such that 


(2) g(xz) = a (1 - =)" ; v(du) for every x > 0. 


(2) Conversely, let v € M. Show that (i) defines an element of Cy. 


METHOD. (1) First use Taylor’s formula to show that limy 4. 94) (x) = 0 
for j = 0,1,...,4—1, then use Problem IJ-5. 


REMARK. It is clear that the functions f,(x) = [(1 — £)*] : play the 1ole of 
extremals in C;; formula (i) shows that the functions in C;, are “barycen- 
ters” of the f,. Formula (i) plays a role in the probability distributions of 
Polya and Askey. (See Problem III-5.) 


Problem II-7. Let wu be a decreasing function defined on (0,+00) such 
that u > 0 as 2 — +00 and {>° x?u(x)drz < oo. Show that, for every 
y > 0, 


+00 4 +00 
vf u(x)da < =f xu(x)dx (K.F. Gauss). 
y ) 
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Describe in detail the case of equality. 


METHOD. Consider a measure pu on (0, +00) for which —u is a distribution 
function. 


Problem II-8. Let u be a decreasing function defined on (—a, +00), with 
a > 0, such that u > 0 as > +00 and f"”° u(x)dx < +00. Show that 


y y Foo 
| u(a)dz < 1 f u(z)dx Vy > 0, 
0 yra —a 


and describe in detail the case of equality. 


METHOD. Consider a measure ps on (—a, +00) for which ~u is a distribution 
function. 


Problem II-9. Let F be an increasing function on [a,b] and let f be an 
integrable function on [a, b]. Show that there exists a number € in [{a, b] such 
that 


b é b 
z)F(x)dx = F(a z)dx+F(b x)dz. 
[ fore) (o) [Flan + PO f Fe) 


a 


(Second mean value theorem for integrals) 


METHOD. Show that this can be reduced to the case where F(a) = 0 


and F(b) = 1, and consider a probability measure yz on [a, 6] such that 
F(x) = p((a,2]) fora é Dr ={x:a<u<b and F(x —-0) < F(x+0)}. 


Problem I-10. Let jz be a probability measure on [0, 1]. Set m = ie cp(da) 
and 0? = fe x? (dx) — m?. Show that o? < }. Describe in detail the case 
of equality. 

Problem II-11. Let f be a positive decreasing function on (0, 1] such that 
Jo f(x)dx = 1, and let d € [0,1]. Let P(dx) = Abp(dx) + (1 — A) f(a)dz, 
where 6g is the Dirac measure at the origin, let m(A, f) = fe zP(dzx), and 
let 02(A, f) = fy, «?P(dx) — m?(A, f). 

(1) Show that o7(A, f) < 1/9. Describe in detail the case of equality. 

(2) Show that 0?(0, f) < 1/9. Is this inequality the best possible? 


METHOD. If Dy is the set of points of discontinuity of f in (0, 1], consider 
the measure vy on (0, 1] such that f(a) = v([x,1]) if ¢ ¢ Dy and show that 
(dt) = tv(dt) is a probability measure on (0, 1]. 


REMARK. If G is a convex function from (0, 1) to [0, 1), it can be shown that 
the measure P on [0,1) which is the image under G of Lebesgue measure 
on (0,1) is of the type considered in the problem. Hence 


2 


[ G?(a)dx = if G(a)az| < . 
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Problem II-12. Let n be a positive integer and let @, @1,...,@n, C1,---.Cn 
be real numbers such that a, < az <... <a, and ¢; > 0 for 7 =1,...,n. 
Let C and R denote the complex and the real numbers completed by a 
point at infinity oo. Consider the function f : © - C defined by f(x) = 00 
if x € {00,a1,...,an} and 


f(v)=rta-> “I if x € {oo,ay,..., an}. 


vo — ay 


The function T : R — R is the restriction of f to R. Lebesgue measure on 
R is the measure m such that m({oo}) = 0 and the restriction of m to R 
is the usual measure. 

(1) Let y € R. Show that the equation in x given by f(x) = y has exactly 
n+ 1 real roots {x;(y)}F29 such that a; < xj(y) < aj41 (with the con- 
vention that a9 = —0o and an41 = +00). Show that 079 24(y) = 1 and 
conclude that T preserves m. That is, for every F in L'(m), 


| F(T (x m(de) =  F(2)m(dz), 
[Fee )ym(ae) = f Fle)m(ae) 
(2) Prove by induction on the integer k > 0 that, for every z € C, 
nm k 
~k-1loy 1 0 = 
Siew -a aw =5(g) w-rer. 


jJ=0 


(3) Let g be a nonnegative rational function such that SR g(x)m(dz) < oo. 
Prove that there exists a rational function g; with the same properties and 
such that the image g(2)m(dz) under T is gi(x)m(dx). Conclude from (2) 
that, if z; is a pole of g; with multiplicity m, > 0, there exists a pole z of 
g with multiplicity m such that f(z) = 21 and m, < m. Calculate g; when 


(4) Let 2 = a+ib € C, with b > 0. The Cauchy measure y, on R is defined 
by 4. (dr) = qe: Prove, using (3), that the image of y, under T is 
Vf (2): 

REMARKS. 1. A Cayley function is a function of the form 


n 


Cj 
x) =cr+a- ; 
fa)=oes0 
j=l 
where c; > 0,7 =0,1,...,n anda, aj,...,@p are real. Ifcg =O andn=1, 


axz+b 


naar with 


it is a positive linear fractional transformation; that is, f(a) = 
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a, b, c, and d real and ad— bc > 0. It is easy to see that all Cayley functions 
can be obtained by composing positive linear fractional transformations 
with the Cayley functions corresponding to co = 1. 

2. It is easy to see that if f is a positive linear fractional transformation 
and T is its restriction to R, then the image of y, under T is yz). This 
observation, the remark above, and result (4) of the problem show that the 
property holds for all Cayley functions. 

3. Conversely, let T’: R — R be a rational function such that, for every z 
with positive imaginary part, the image of y, under T is a Cauchy distribu- 
tion yz, (where z; depends on z). It can be proved that T is the restriction 
to the real axis of a Cayley function. 

4, On the other hand, a Cayley function with co > 0 maps Lebesgue mea- 
sure m to com. If co = 0, the image measure is no longer a Radon measure 


on R. For example, f(x) = —4+ maps m(dr) to min) 


Problem IJ-13. The half-plane RZ. = {(z,y): 2 €R and y > O} is 
equipped with the measure p(dz,dy) = fy. What is the image v on 
[1, +00) of this measure under the mapping (x,y) + v(z,y) = ay (t +27 + 
y”) (in the sense of Problem I-14)? 

Problem II-14. Let {un}n>o be a sequence of positive measures on R, 


each with total mass < 1. Suppose that yu, converges weakly to flo as 
n — oo and that 
+00 
M= sup | x? Un(dz) < 00. 
n —oo 


(1) Show that yu, converges narrowly to jig as n > oo. 


+00 +00 
(2) Show that i, |a|Un(dx) i |x|uo(dx) as n > oo. 
—oo —0o are 
(3) Show by a counterexample that i 2° Un(dx) does not necessarily 


+00 oP 
tend to / x W9(dz). 


METHOD. Use Theorem II-6.8. 


Problem II-15. If g is a measurable function on (0, +00) which is locally 
integrable, and if A = limpoic He g(x)dz and B = lim, 49 f g(a)dx 
exist, we say that Jere g(x)dx exists and equals A + B. 

Let f be measurable and locally integrable on (0, +oo) and suppose that 
limyp_+00 ie f(z) # exists. Let a and 6 be positive. 
(1) Suppose that K = f.~ f(x) exists and let F be defined by F(x) = 
Ji f(ddt. Show that f>°[F (ax) — F(bx)]} exists and express the integral 
in terms of a, b, and K. 
(2) Suppose that L = lim..o f(x) exists. Show that fo°(f(ax) — f(bx)) 
exists and express the integral in terms of a, b, and L. 
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Problem II-16. Writing 2~! = ti e ¥*dy for x > 0 and applying Fubini’s 
theorem, show that the integral {,~ sinv“ exists (in the sense of Prob- 
lem JI-15) and compute it. Use this to salute the integrals Jo (cos ax — 


cos bx) 43 and i cos ax — cos bx) 2 if a, b > 0. (See Problem I-15.) 


Problem II-17. For an interval J in R, L?(I) denotes the set of real- 
valued functions (rather, equivalence classes of functions) whose pth power 
is integrable with respect to Lebesgue measure on I. 
(1) Show that L?’((0,1]) c L®([0,1]) if 0 < p < p! < ov. Give an example 
of a function in £1((0, 1}) \ £({0, 1]). 
(2) Give examples of functions in L1(R) \ L?(R) and in L?(R) \ L'(R). 
(3) € is the set of real-valued sequences a = {an }n>o0 such that > |a,|? < 
oo. Show that ¢?'(N) > £?(N) if 0 < p < p’ < oo. Give an example of a 
sequence in ¢? \ @1(N). 
Problem II-18. Let Hes denote the set of pairs (a,p) with p > 0 and 
a € R”. Euclidean space R” is equipped with the scalar product (a,t) and 
the norm |la||. Let 

K(a,p) = Kyp[llall? +p)", 
where K,, is the constant such that f,, K(2,1)dx = 1. The goal of this 
problem is to calculate 


Ii(a,p) = | expi(x,t) K(x —a,p)dz, 
R” 


where t € R”. 
If f: Rit! = C, we write Dof = gf and Djf = x2-f for j=1,....n 


is said to be harmonic in R”*! if 
f 
(Dé +---+ D2)f(a,p)=0 for every (a,p) € RY. 


(1) Show that K is harmonic in R"*+’. Show that, if pp > 0 and V = 
Bo 3P0) there exists a constant C such that |D;K(a,p)| and |D;D;K(a,p 
2° 2 


are less than C(1+|lal|2)~ “2 for all (a,p) € R" x V and i, j =0,1,. 
(2) Let ys be a Radon measure on R” such that 


[04 beter |(dr) < 2 


and let F,,(a,p) = JR» K(a — a,p)u(dr). Show that F, is harmonic and 
that ines F,,(a,p) = 0. 

(3) Show that there exists a function g : R™ — C such that I;(a,p) = 
g(pt) exp(i(a, t)). 

Use (2) to calculate g. 
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REMARKS. 1. In n dimensions, K(x — a, p) is sometimes called the Poisson 
kernel; in R”, it is sometimes called the Cauchy distribution. 

2. The calculation giving K, =T(24+)m-(@+)/? js carried out in Problem 
Iil-4. 


Problem II-19. (1) Let 4: and v be positive measures on R such that there 
exists an interval [a,b] C R with p((a,b]) = w(R) and v([a,b]) = v(R). 
Show that yw = v if and only if 


[etwac) = [ z"v(dz), YWn=0,1,2,.... 
R R 


(2) Let ys be a positive measure on [0, +00) (not necessarily bounded). Its 
Laplace transform is the function from R_ to [0, +00] defined by 


ule) = [ * e-* u(dz). 


(a) If By, = {s : (Lyu)(s) < oo}, show that EH, is an interval which, if 
nonempty, is unbounded on the right. Give examples where EL, = R, 9, 
(0, +00), and [0, +00). 

(b) Use (1) to show that if there exists a number a such that Lu = Lv < 
+oo on [a,+oo), then wp =v. 


Problem II-20. Give examples of sequences {jin}, of positive Radon 
measures on R such that there exists a positive Radon measure with 
limp —oo Un =U 

(1) vaguely but not weakly; 

(2) weakly but not narrowly; and 

(3) narrowly but not in norm. 


REMARK. If the sequence of positive measures {1 }°°, converges vaguely 
to wand u(X) < co, then pu, — py weakly, since Cx (X) is dense in Co(X). 
It should also be noted that narrow and weak convergence coincide when 
X is complete. 


Problem I]-21. Let X be a locally compact space which is countable 
at infinity and let M1(X) be the set of signed Radon measures v on X 
such that |v| has finite total mass ||v||. If {v,}2@, is a sequence in M1(X) 
such that r = sup,, ||vn|| < oo, show that there exist v in M1(X) and an 
increasing sequence of integers {nx }?2, such that vz, — v as k — oo. Show 
also that v > 0 if vy, > 0 for every n. 


METHOD. Use Theorem II-6.6. 


REMARK. When X = R, vy, > 0, and r = 1, this property is often called 
Helly’s theorem. 


Problem II-22. On a locally compact space X which is countable at infin- 
ity, let w and {u,}°2, be positive Radon measures such that jun converges 
vaguely to “4 as n — ov. 
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(1) If O is an arbitrary open set, show that u(O) < liminf,.. fn(O). 

(2) Suppose that O is an open set with compact closure K and such that 
its boundary 0O = K \ O has p-measure 0. Let {Ox }?2., be a decreasing 
sequence of open subsets of X such that N22 ,O; = K. Let fy be a function 
equal to 1 on K& and to 0 on Of and satisfying 0 < f(x) < 1 for x in Ox. 
(Such a function exists by Urysohn’s lemma, II-1.1.) Show that 


limsup pin(O) Sf fel)q(de. 
no x 
and conclude that j4,(O) - p(O) as n > ow. 
(3) If p and {j1,}°2, are Radon measures on R, positive and with total 
mass less than or equal to 1, show that , converges weakly to pz as n — 00 
if and only if 

Hin ((a,b)) > w((a,6)) asin — 00 


for all points of continuity of the distribution function « H p((—co, £)). 
If, moreover, pin(R) = u(R) = 1, show that uw, — pw narrowly if and only 
if 
Jin ((—00, 2) + p((—00,2)) as n+ 00 
for every point of continuity of the right-hand side. 
METHOD. Use Problems IJ-1 and I-21 together with Theorem II-6.8. 


REMARK. In practice, (3) gives a necessary and sufficient condition for 
the convergence of probability distributions on R; it is often taken as a 
definition in elementary texts. 


Problem II-23. Let X be a locally compact space which is countable at 
infinity, and let w and {,}° , be Radon measures on X such that pp, 
converges vaguely to w. 

(1) If O is an open set in X and y* is the restriction of ps to O, show that 
1% converges vaguely to p* as n — ov. 

(2) Show by an example that the statement is false if O is replaced by a 
closed set. 

(3) Suppose that X = R and that uw, > 0, n = 1,2,.... Let a and 6 
be real numbers with a < b. Show that there exist numbers p and q and 
an increasing sequence of integers {n,.}72, such that, for every continuous 
function f on [a, dj, 


fing > pf(a) + af (0) + if Ps asene Sia 
[a,b] [2.5] 
METHOD. Use Problem II-21. 


Problem II-24. (1) Let O and O’ be two open sets in R”, let f be a 
diffeomorphism from O onto O’, and let y be a measurable function on O! 
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such that Sor Of "\dx' < co. Show that 


] elfle)MdetJy(a)lde =f o(a'yae' 
(6) ou, 


where |det J+ (x)| is the Jacobian. 

(2) Let a € RU {-oo}. Let f and g be functions satisfying the following 
conditions: (i) f is continuously differentiable for x > a; (ii) g is defined 
and integrable on [0,+00); (iii) |f/(@ + “| < g(u) for all x > a; and 
(iv) both u > ug(u u) and ur> f(a + =) are integrable on [0, +00). If 
F(z) = ee flat > *\du, show by a change of variables in polar coordinates 


that 
1 +00 2 


f(2)=-5 F'(a+ —)dv. 


REMARK. The case f(x) = e~* is well known and is used in IV-4.3.2(i). 


Problem II-25. Consider a subset X of R” with positive measure, a 
measurable function f : X — R”, and a nonnegative locally integrable 
function h on X. Let ys denote the image in R” of the measure h(x)dx on 
X under f (in the sense of Problem I-14) if this image measure exists. 
(1) If X and U are open sets and f is a diffeomorphism from X to U, show 
that 

u(du) = A(f~*(u))|detJ ps (u)|du. 


(2) If there exist an open subset U of R” and disjoint open sets X1, 
X9,...,Xq contained in X such that the restriction fj of f to X; is a 
diffeomorphism on U, and if X \ ae , X; has Lebesgue measure zero, 
show that 


(du) = y h(f; *(u))|det J, (u)[1u(u)du 


(3) If X = (0, +00), e(x) = 27 */? exp[—(az + b/z)], A(x,y) = e(x)ec(y), 
and f(x,y) = (u,v), with u = x+y and v = 1/x+1/y, calculate p. 
Conclude from the result that the image of hdrdy under the map (2, y) 
(x+y,1/x +1/y —4/(x + y)) is also a product measure. 


REMARKS. 1. The use of the change-of-variables theorem (II-4.4.1) to cal- 
culate the image of a measure is important in practice, especially in prob- 
ability theory. 

2. Problem H-12 treats a special case of (2) for n = 1. 

3. (3) shows that if X and Y are independent random variables of density 
Kc(x)dz (a distribution called “inverse Gaussian”), then X +Y and 1/X + 
1/Y — 4/(X + Y) are independent. It seems difficult to justify this result 
by Fourier analysis. 
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Problem ITI-1. Let G be the group Og of dx d orthogonal matrices, acting 
on the Euclidean space R?. The scalar product and the norm are denoted 
by (x,t) and ||t||, respectively. Let y be a bounded complex measure on 
R?, with Fourier transform 


Alt) =f explite,t)ulde) (ve R®) 


Prove the equivalence of the following three properties: 

(1) pw is invariant under every element of G. 

(2) There exists y : [0,00) + C such that f(t) = y(|t||) for every t. 

(3) The image v, in R of y under the mapping x ++ (a,x) does not depend 
on a when a ranges over the unit sphere S,_1 of R2. 


REMARK. Naturally, if is real, then f(t) = (—t) implies that ¢ is real. 
But yz > 0 does not imply that y > 0. Thus, if o is the uniform probability 
measure on $2, the unit sphere in R?, a(t) = Ta 

Problem III-2. Let T be a compact space, let G be a compact topological 
group, and let (g,t) +> gt be a continuous map from G x T to T such that 
g > {(g,t) + gt} is a homomorphism from G to the group of bijections 
of T. Finally, suppose that (G,T) is a homogeneous space; that is, for 
every t, and f2 in T there exists g such that gt; = to. Let dg denote the 
unique measure of total mass 1 on G which is invariant under left and right 
multiplication. (We accept without proof the existence and uniqueness of 


dg.) 
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(1) If f is continuous on T, show that t+ f, f(g7't)dg is a constant o[f]. 
Conclude that o[f] defines a probability measure on T which is invariant 
under the action of G. 

(2) If ye is a probability measure on T which is invariant under the action 
of G, show that g > J, f[g~'t]u(dt) is a constant. Integrate with respect 
to dg and conclude that uw =o. 

(3) If (X, A) is an arbitrary measurable space and T is equipped with its 
Borel algebra, let T x X be given the product o-algebra. Suppose that G 
acts on T x X by g(t.x7) = (gt, xv). Show that every positive measure jz: on 
T x X which is invariant under the action of G has the form o(dt) ® v(dz), 
where v is a measure > 0 on (X,.A). Converse? 

Mernuop. If A € A is such that p(T x A) € (0, +00), show that p4(B) = 
Heed 
(4) Apply the preceding results when T = Sy is the unit sphere of the 
Euclidean space R°*!, where G = Oy41 is the group of (d+ 1) x (d+ 
1) orthogonal matrices and X = (0,+00). Conclude that a probability 
measure P on R“*! \ 0 is invariant under G if and only if Tel and ||a|| are 


defines a probability measure on T which is invariant under G. 


independent and Tal has the uniform distribution on Sq. 


Problem III-3. In the Euclidean space R% equipped with the norm |ja||, 
let m be Lebesgue measure. 
(1) If vp and 1; are the images of m in [0, +00) under the mappings z + ||x|| 


and © el (see Problem I-14), show that 


where I is the usual Euler function (see, for example, Problem IV-11). Use 


this to find vo(dp). 
Sa Lol 
——- exp( — =k 
ot ( 274 Rd Dp 


which holds for all ¢ > 0, to calculate the Laplace transform (Lv;)(s) 
defined in Problem I-19. 

(2) Keep the same notation m and v for the restrictions of m and vp to 
R‘\ {0} and (0, +00). If is a measure > 0 on R‘\ {0} which has density 
f with respect to m, use Problem III-2 to show that the image of yz on 
(0,-+oo) under the map +> ||2|| is of the form f(p)vo(dp) and calculate 
the function f; in terms of f. If yw is rotation invariant, show that there 
exists a function f, : (0,00) — [0, +00) such that fi(||z||) = f(a) m-a.e. 


METHOD. Use the formula 


I|z||? 
20? 


Problem III-4. Euclidean space R? is equipped with the scalar product 
(x,t) and the norm ||¢||. P is the usual Euler function. 
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d 
(1) Use Problem IJJ-3 to evaluate J = | —__“* ____ Ifa and t are in 


Ré (1+ |le|?)" 
R? and p > 0, use Problem I-18 to conclude that 


d+1 
i) gitest) pdx = ew Plltll+i(a.t) 
d+ 2 2) tht : 
me JR¢ (p? + ||z — all?) = 


(2) Show that, if 2 € R? and p > 0, 


pitty os =i on llell-+ite.t) ay 
2 (p? + |[x||?)"F Je 


Problem III-5. Let k be a positive integer. In the Euclidean space R7*~!, 
the norm is written ||t|| and the scalar product (x,t). Consider the map 
y: R**-! 1. [0,1] defined by 


g(t) = [= et]. 


(1) Using Problem III-1, show that there exists a continuous function f : 
(0, +00) — R such that 


fel) =f explite. nolo 


(2) Use Problems ITI-3 and II-4 to show that, for every s > 0, 


k 


I a ey 3"-1 F(u)du = Cy ff e “(1—cosu)dul , 
0 0 


where C;, is a constant. 
(3) Show that f > 0 and that fpox-1 f(a)dax < oo by using Problem [I-19 
and the sequence of functions f,, : [0, +oo) — R defined by f;(u) = 1—cosu 
and fnzi(u) = fi" flu — p)filp)dp. 

Conclude that y is the Fourier transform of a probability measure on 
R**~!. Compute it for k = 1 and k = 2. 
(4) Suppose that g : [0,+00) — R is continuous and satisfies the follow- 
ing conditions: (i) g(0) = 1; (ii) (—1)*~!g“-) (x) exists and is convex on 
(0, +00); and (iii) lim,4 9(z) = limz— 4.0 g*~ (x) = 0. Use Problem 
II-6 vorhow that g(|l¢||) is the Fourier transform of a probability measure 
on R**. 
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REMARK. The result of (4) for k = 1 is due to G. Polya (1923), and the 
general case to R. Askey (1972). 


Problem III-6. Let C denote the complex numbers. A function p: C — 
[(0, +00) is called a seminorm if 


(i) p(Az) = |A|p(z) for all \ € R and z € C, and 

(ii) p(z1 + 22) < p(21) + p(22) for all z; and zz in C. 
(1) Let p: C — [0, +00) satisfy (i). Prove the equivalence of the following 
properties: 

(a) p is a seminorm. 

(b) {z: p(z) < 1} is a convex subset of C = R’. 

(c) For all a1, a2, a3 such that a; < a2 < ag and a3—a, <7, 


(iti) p(e’*) sin(az — a1) + p(e"®?) sin(ag — a2) — p(e'®?) sin(a3 — a1) > 0. 
(2) Let js be a bounded positive measure on (0,7). Show that 
us 
(iv) pp(z t+ ty) = i |x sina — ycos alju(da) 
Jo 
defines a seminorm. Show that p, = py, implies w= (11. 


METHOD. Observe that p(e”) is the convolution of 4 and |sin@| in the 
group R/rZ. (See IHI-1.8.) 
(3) Let 0 < ay < ag <...4) < 7, with the convention that @9 = @, — 7 


and Q,41 = a, +7. The matrices A = (ajj)Pj21, B= (bis )Pjat> and 
D = (dij)}';=; are defined as follows: 

a;; = |sin(a; — a;)| for all 7,7 = 1,....n. 

b;; = —sin(aj41—Qi-1), bi.i41 = Sin(a;—a;—1) (with the convention that 
Dante = bn.1); bii-1 = Sin(ai41—@;) (with the convention that 61.9 = bin) 
fori =1,...,n, and bj; = 0 otherwise. 

dj; = 2sin(aj;41—a;) sin(a;—a;—1) fori = 1,...,n and d,; = 0 otherwise. 

Verify that AB = D. If p = Y0)_) mjba,, where m; > 0 and dq, is the 
Dirac measure at a; for 7 = 1,2,...,n, calculate p(e’’) and verify that 
(v) [m™m1,M2,-.-,MnlA= [pate Mh per?) che, pe )] ; 


(4) If p is a seminorm, show that there exists a bounded positive measure 
pon (0,7) such that p = p,. 


Metuop. Let T = {a),...,Qn} with ag =a, —-7< 0S a1 <...< an < 
T < Qn41 = 01 + 7. Show that there exists a seminorm pr such that, if 
0<A<landj=1,...,n, 

(vi) pr [Ae + (1 - djet'+1] = Apr [e’*] + (1 — A)pr [eto] , 


and show by using (3) that there exists zr concentrated on T such that 
PTY = Pur: 
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Next, let a; = Goin and set pn = pr and py, = pr. Show that p = 
limn—co Pn and that there exists a bounded positive measure yu on [0, 7) 
such that py, converges vaguely to ps as n — oo. 


REMARKS. A consequence of (4) is that every seminorm on R? can be 
approximated by finite sums of the type ay |a;x + b;y|, and not only by 
sup; |a;x+b;y|. For R” with n > 2 this is false; in general, a seminorm can 
be approximated only by suprema of absolute values of linear functionals. 


Problem III-7. Let C be the set of complex numbers, identified with 
R’, and let p be a seminorm on C. Show that exp(—p(t)) is the Fourier 
transform of a probability measure on R?. 


METHOD. Use the fact, proved in Problem III-6, that there exists a sequence 


of measures 4, > 0 on [0,7), concentrated at a finite number of points, 
such that 


vim 
p(z +ty) = tim, |zsina — ycosa|un(da). 
oo Jo 


Also use the formula e7!t! = fc 
III-4. 


oe ete a) which appeared in Problem 


REMARKS. This result is due to T. Ferguson (1962). It is false in higher 
dimensions; only for certain norms (like the Euclidean norm) is exp(—p(t)) 
the Fourier transform of a probability measure. See Problem III-8 for a 
counterexample. 


Problem ITI-8. (1) What is the image v in R, under the projection 
(0,--.,2n) + Xo, of the measure exp(— max;=o,....n |v;|)dxodz, ...dxy 
in R”? (See Problem I-14.) 

(2) Compute the Fourier transform of v. 


METHOD. Show that k!(1 — it)-(#*) = f° ck exp(—a + ita)dx for t real 
and k a nonnegative integer. 

(3) Conclude that yn4i(t) = exp(— max;<=o,....n |tj|) is not the Fourier 
transform of a probability measure on R"*! when n > 2. 


REMARK. (3) is due to C. Herz (1963). 

Problem III-9. Let E be n-dimensional Euclidean space. 

(1) Ifa > 0, 6 > 0, and a+ < n, show that there exists a constant 
K (a, 8) such that I(y) = fr llell°"" lly — all?" de = K(a, B)|lyllete-. 


METHOD. Use Problem III-3. 
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(2) Let 0 < y < n and let M, be the set of positive measures jp, not 
necessarily bounded, such that fl L) = Joy p lle — yl?" u(da)u(dy) < 00. 
Show that, if ~ and v are in M,, 


Lf le- uh mtdeye(an) = VFGOTO. 


Problem III-10. Let M be the space of real Radon measures on U = 
{z : z € C and |z| = 1} and let F* (respectively F~) be the vector 
space over R of complex functions defined in {z : |z| > 1} = D* (resp. in 
{z:|z|< 1} = D7). For p € M, we define 


2) = te" —z)"!du(e) for z € Dt, 
fil= he —z)~ldu(e) for z€ D>. 


(1) Show that the linear mapping y»+> f* from M to F* is injective. 


METHOD. Expand ft in a power series in 1/z. 

(2) Find the kernel of the linear mapping +> f,, from M to F™. 
REMARKS. 1. Although ii determines 1, f,, does not. 

2. The situation is completely different if uw is complex, since there exist 
complex measures, like dy(e'®) = e~'"d@, for which fi(n) = 0 for all n > 0. 
Problem IJI-11. Let P(a,..., ,t,) = P(x) bea paepecuaale polynomial 
of degree m in n variables which is harmonic; that is, ane oa YP (x) = 0 for 
all z in R”. For a fixed o < 0, let 


et —n —|\x||? : 2_ 
(a) = (oV2r)~" exp 362 Plz) with: | |la|P = ee: 


Show by induction on m that there exists a number A,,(o) such that 


oa —g? 2 
Fi) = Kmla) P(t) exp (=) | 


METHOD. mP = Ss 
Problem III-12. The goal of this problem is to prove the following in- 
equality of S. Bernstein: If yz is a complex measure on [—a, +a], then 
ji’(t)| < asupyer |H(s)]- 
(1) Consider the odd function h(0) of period 27 defined by h(@) = @ if 
0<60< 7/2 and A(0) =x7-Oifn/2<O0<T. 

(a) Compute v, = (2im)7" gi h(0) exp(—in@)dé for n in Z. 

(b) If v is the measure defined on R by v = 4g Unbn, where 
6, is the Dirac measure at n, show that v is bounded and that h(6) = 


i (cer exp(iz@)v (dz). 
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(2) If 4 is a complex measure on [~7/2, 7/2], let 


=o Uh 


m{2 
f(t) = i. PEE) HL A) 


(a) Show that f(t) = (f * v)(t) for all real t. 

(b) If = fo = (2%)~'(6z — 6_=), deduce from (a) that ro (2k = 
1)? = 17/4, 

(c) Returning to the general case, deduce from (a) and (b) that 


[f’(t)| < 3 up lf(s)| for all ¢ in R. 
Ss 


Show that equality holds if and only if 4 is concentrated at the points 
zn /2. 
(3) Prove Bernstein’s inequality and discuss in detail the case of equality. 


Problem ITI-13. Let f : (0,+00) — R be measurable and satisfy 
f(ct+y) = f(x) + f(y) for all x and y > 0. 


(1) If y(t) = if. explitf(ax)|dx for t € R, show that y +> y(t) exp[itf(y)] is 
continuous on (0,+00) and conclude that f is continuous. 
(2) Show that f(x) = af(1) for xz > 0. 


Problem ITI-14. Let E be a real vector space of finite dimension n and let 
E be its dual. Let €1,-.-,€n bea basis of &. The dual basis ej,...,e7, of E 
is defined by (e;,e*) =O if 7 #7 and 1 if j = 7, where (, ) is the canons 
bilinear form on E x E. E and E are equipped with Lebesgue measures dx 
and dt, respectively, such that, if f € L'(E,dx) implies f € L'(E£), where 
f(t) = LE exp(i(x,t))f(x)dx, then f(x) = (27)7” Jgexp(—i(z, t)) f (t)dt. 
Let Z denote the set of points z = St zie; of E such that the z; are 
integers and let Z* denote the set of points ¢ = $7", Ge? of E such that 
the ¢; are integers. 

Prove Poisson’s formula: 


If f is in the space S of infinitely differentiable functions of 
rapid decrease, then for every t in E 


S_ f(2n¢ +t) = [vol(ef,...,e%) er) S* F(z) een 

¢EeZ* z€Z 
METHOD. Show that >°,-7|f(z)| < oo and use Theorem III-4.2 to see 
that the left-hand side w(t) of the equation exists. Observing that the set 
of periods of w contains 27Z*, compute the Fourier coefficients of 1. 


REMARKS. 1. With the above hypotheses on the choice of dt on EB, it can 
be shown that 


vol(€1,..-,@n) x vol(e*,...,e%) = 1. 
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Without loss of generality we may assume that vol(e1,...,é€,) = 1. Let 
E be given the Euclidean structure such that (e),...,én) is orthonormal; 
then E can be identified canonically with EF, é5 = e,;, and dz and dt are 
identical. 

2. Poisson’s formula is also valid in some situations that differ slightly from. 
that where f € S(E). One of these occurs when f € L'(E), f > 0, and f 
has compact support. 

3. A striking application of Poisson’s formula is that if 


+00 
g(a) = Vo S- exp(—o?1n’), 


n=—O 


then g(a) = g(o~'). To prove this, it suffices to take E = R, e, = 1, and 
f(x) = exp(—27?2?/o7). 


Problem III-15. Let EF be a real vector space of dimension n > 0, let E 
be its dual, and let E be equipped with Lebesgue measure dz. It is always 
true that E = E. The canonical linear form on E x E is written (, ). We 
consider the following operators, where a € EF, b€ EB, c (respectively d) is 
an invertible linear mapping from £ into E (resp. from E into BE), and ‘ce 
(resp. ‘d) is the transpose of c (resp. d). 

For f € L°(E), 


Taf(z) = f(e-a), Mof(z) =e" f(x), Hef(x) = f(e*2), 


and Uf € L?(B) is the Fourier-Plancherel transform described in HI-2.4.9. 
For g € L7(E), 


Thg(t) =g(t—b), Mag(t) =e" g(t), Hag(T) = g(d7't), 


and Vg € L?(E) is the Fourier-Plancherel transform. 
Prove the following formulas: 


UT, = MU 


1) VT; = M,V 
2) UM,=ThU 
3) 


(1) 
(2) VM, =T.V 
UH, = |det c|H(ee)-1U (3 ) VAa => |det d|Hitay-1V 
4) (H_1,U)(f) = UCP) ie apna) 

5) U-l=(2n)"H_y.V (5) Vol = (20) "H_y,U. 


NN 


(Here 1g and 1 yj are the identity operators on & and E , respectively. 


Problem III-16. Use the result of Problem IV-12, 


= iat a 
| go-le-etiet SF (1—it)"° forte Randa>0, 
0 (a) 
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with the convention for z* with Rez > 0 made in Problem IV-12, to com- 
pute the Fourier-Plancherel transforms of the following functions in L?(R): 
(1) la|2-e-*1)0,4.00)(@) 

(2) |z|°~ te? 1(—c0,0)(2) 

(3) |a|*~*e* 

(4) —é- sign(x)|a2|¢~te7!#! 

(5) (c —a—ib)~”, with n a positive integer, a and 6 real, and b #0 

(6) b(a? + b?)~! 

(7) a(x? + b?)~! 

(8) f(x), where f(z) is a rational function with no real poles and without 
entire part. 


METHOD. For (5), use (1) and problem IIJ-15. 


Problem III-17. Compute the Fourier-Plancherel transforms of the fol- 
lowing functions: 

(1) 1-141) (2) 

(2) 1a, (x) 

(3) sina/x 

(4) sin? 2/2? 

(5) (1 — |2|)* 

(Here at = max{0, a}.) 

Problem III-18. If f € L?(R) and (Uaf)(t) = fe’ f(a)dz, show that 
limgoo Ua(f) = U(f), where U denotes the Fourier-Plancherel transform 
of 


Problem III-19. If f and g are in L?(R), show that 


[ t@a@ar = [ Fo)a(evae. 


METHOD. Use the fact that L'M L?(R) and A(R) are dense in L7(R). (See 
Il-2.4.7.) 


Problem III-20. Let g,(z) = i(signr)e~"l"! for b > 0, let U be the Fourier- 
Plancherel transform in L?(R), and let M,, be the operator on L?(R) 
defined by My, f(x) = go(x) f(z). Set 


Hy = U7'M,,U. 
(1) Show that Ha f(x) = 1 re gi f(x — y)dy for almost every 2, if 
f € L?(R) and b> 0. 


METHOD. Use Problem III-16(7) to compute U(g,), then apply Problem 
IIL-19. 
(2) If f € L?(R), show that Hof = limp, 4 x ee x EP 4, f(x — y)dy exists in 


L? and give its Fourier transform. Also calculate 2. 


(Hof) Purely Real 
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REMARK. Hof is called the Hilbert transform of f. 
Problem III-21. Suppose that f € L?(R) and g € L1(R). Show that 


+00 
h(x) = / flu y)gly)dy 
J~x 
exists for almost every x and defines a function h in L?(R) such that 
lhllzz < [Ff llz2ilgilp: and h = Gf (where g is the Fourier transform of 
g € L' and hand f are the Fourier-Plancherel transforms in L?). 


METHOD. Apply the Cauchy-Schwarz inequality to | f(«—y)| |g(y)|!/2 (con- 
sidered as a function of y) and |g(y)|!/? and use Problem IL-18. 


Problem ITI-22. Let 0 < «<a and let gea(y) = (ty) 7! Lye<jyj<a}(y)- 
(1) Compute lim, limg— +50 Ge.a(t). where g-.q is the Fourier transform 
on L!(R) of gq. (Use Problem II-16.) 

(2) For f € L?(R), we set 


ai fdy 
ee) 3 eee u—y 


(This equals f * g.. in the sense of Problem III-21.) Using Problems HI-20 
and III-21, show that lim, lima.++ He.a(f) exists and coincides with 
the Hilbert transform of f (Problem HI-20). 


Problem IIJ-23. A function f in L?(R) is called hermitian if f(x) = 
f(—a) and skew hermitian if f(x) + f(—ax) = 0. Let f denote the Fourier- 


Plancherel transform of f and let Hof denote the Hilbert transform of f. 
(See Problems III-20 and II-22.) Prove the following statements: 


Ae Hermitian Skew- Real Even 
hermitian 


Purely 
imaginary 


f is Real Purely Hermitian Skew- Even 
iff imaginary hermitian 


Odd | Even 1 
| 


Skew- 
hermitian 


Hermitian 


iff imaginary 


Odd | Even 


— 


Real Purely 
imaginary 


Hermitian 


(Hof) is Skew- 
hermitian 


Problem III-24. Compute the Hilbert transform (see Problem II-20) of 
each of the following functions: 


ile) = pape fox) = sate 
fol) = Frye),  falz) = Plog) 24), 
fs(z) = G-|al)*, fe(z) = +log|##4| + 4 log wel 
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METHOD. Use Problem III-16 for f2 and Problem III-22 for fe. 
For fs, at = max{0, a}. 


Problem III-25. Let S be the vector space of C® functions on R which, 
together with all their derivatives, are of rapid decrease. 

(1) Show that if f € S, then lim,_.9 ase £2) dy exists and defines a con- 
tinuous linear functional (or “tempered distribution”) on S. 

(2) Show that the Fourier transform of the distribution defined in (1) is 
the Radon measure p(dt) = in(sign t)dt. 


METHOD. Split the first integral into {e < |z| < 1} U {|a| > 1}. Also use 
the fact, proved in Problem I-16, that ye dr = 5. 


Problem III-26. Let I = (a,b) and let f € L'(J). 

(1) If F(z) = f* f(t)dt for x € I, show that F’(x) = f(x) in the weak 
sense (IIJ-3.3.1). 

(2) Fe a and F’ = f in the weak sense, show that, fora<a< 6 < 
b, 


B 
[ fea = FB) - Fa). 


(3) Let s be a positive integer. Show that F is in Hj, the local Sobolev 
space (see III-3.5.6), if and only if there exists f € L?,.(1) such that the 
weak derivative of order s—1 of F exists in the ordinary sense and satisfies 


BENG) Sea +f soar 


for all aw and z in J. 


Problem III-27. Let f € L?(R), with Fourier-Plancherel transform f. 
Prove Hermann Weyl’s inequality, 


0 2 oo 0° 
[venta] <2 fipeyparx f eifora 


and analyze the case of equality. 


METHOD. Without loss of generality, assume that f is in the Sobolev space 
H'(R). Show that [72° |f(x)?de = —2Re [7° xf(x)f’(a)dz, with the 


co 


help of Problem I-15(2). Conclude by using the Cauchy-Schwarz inequality 
(Problem I-12). 


REMARK. This inequality has an interpretation in quantum mechanics, 
where it is known as Heisenberg’s uncertainty principle. ° 


3H. Weyl, The Theory of Groups and Quantum Mechanics (London: Dover, 
1931). 
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Problem IV-1. The points marked on the faces of two dice are, respec- 
tively, for the first: 1, 2, 2, 3, 3, 4; for the second: 1, 3, 4, 5, 6, 8. 

If X is the sum of the points obtained by throwing the two dice, compute 
P|[X = k] for integer k. Answer the same question for ordinary dice. 


Problem IV-2. The random variable X is called a geometric distribution 
with parameter p, 0 <p <1, if 


P[X =k] =(1—p)*"'p, k=1, 2, 3,... 


Compute E(.X) by using Problem I-6(1). 


Problem IV-3. Suppose that 6, is the Dirac measure at a, p € (0,1), and 
> 0. Consider the following two probability measures on N: 


VY = (1—p)bo + p61 (Bernouilli distribution with parameter p) 
[ies ee, ee eA’ 6, (Poisson distribution with parameter A) 


(1) Show that the vague limit of the sequence {v3"}n5) is uw, and that 
Bd, * dg = bd, 422- > 

(2) Let 0 < p < 1. Consider the measure m, on N? concentrated at the 
points (0,0), (0,1), (1,1), and (k,0) with k > 2 (note the absence of (1, 0)), 
such that X has distribution 4, and Y has distribution v, if (X,Y) has 
distribution m,. Compute m, and conclude that P(X # Y) < 2p’. (Use 
the fact that e~? > 1 — p.) 
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(3) If (X,Y) is an arbitrary variable in N? and A CN, show that 
[P(X € A)— P(Y € A)| < P(X #Y). 


(4) Let (X1,¥1),...,(Xn, Yn) be m independent random variables with val- 
ues in N? and with distributions Mp,; Mpy.--+, Mp, Let A CN. Use (2) 
and (3) to show that 


[P(X +--+ Xn € A)— P(¥i +--+ +¥n € A S25) 7. 


jal 


(5) Ifm > A and A CN, show that 


*TL 2d? 
Win (A) — Br(A)| < Tat 


REMARKS. The approximation of the binomial distribution by the Poisson 
distribution is both elementary and essential for applications. (5) gives an 
upper bound for the error committed by replacing a binomial distribution 
vy, by a Poisson distribution pinp,, and (4) treats the case of experiments 
that are independent but not identical. This result is due to J.L. Hodges 


and L. Lecam (1960). 


Problem IV-4. On a probability space (0,A,P), we define a random 
variable N with positive integer values and random variables {Xp }n>1, 
with values in a measurable space (J, 8), such that the X,, all have the 
same distribution m but are not necessarily independent. 

(1) Show that the distribution yz of Xy is absolutely continuous with respect 
tom. 

(2) If f(z) = a (x) and a > 0, show that 


BN") 2 ay fF e)am(ay, 
METHOD. If B(y) = {a € I: f(x) > y}, show that 


(i) w(B(y)) < S> P[Xn € Bly) + PIN > yl 

nsy 
and use Problem I-6. 
(3) Show that 1+ E(log N) > f f(x) log f(x)dm(a) by letting a | 0 in (2) 
and using the monotone convergence theorem. 
Problem IV-5. With the notation of Problem IV-4, we take J = [0,1], 
B = the Borel algebra, and m = Lebesgue measure, and we assume that the 
{Xn}n>1 are independent. Let f : J — [0,+00) be a nonnegative measur- 
able function, bounded by a number } > 1, which satisfies c f(a)dx = 1. 


Let 
N = inf{2n: bXan-1 < f(Xan)}- 


Show that Xj has density ae =f. 
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REMARK. This procedure for constructing a random variable of given den- 
sity f on [0,1] from uniform random variables was invented by J. Von 
Neumann in 1951. 


Problem IV-6. (1) Let Y be a positive random variable. Show that for 


all y > 0 


P(Y >y)< LEY) (Chebyshev’s inequality). 
y 


(2) Let X be a real random variable such that E(X?) < oo. If m = E(X), 
show that for all t > 0 


1 
P(|\X —m|>t)< p(x —m)*) (Bienaimé’s inequality). 


(3) Let {X,,}°2, be a sequence of independent real random variables with 
the same distribution and such that E(X?) < oo. If m = E(X1), show that 
for all « > 0 and for all a € [0, 5) 


?| 
(weak law of large numbers). 
(4) Let {X,,}°2, be a sequence of independent real random variables with 
the same distribution, for which there exists k > 0 such that Elexp k|X,|] < 
oo. If m = E(X,), show that for every « > 0 there exists q in (0,1) such 
that 


Xi ts +Xn 5] oan 0 


n 


p |e th 


=n > ( < 2q”. 
n 


Conclude that +(X, + ---+X,) — m almost surely as n — oo (strong 
law of large numbers). 


METHOD. Show that m = 4[E(exp(sX1))],=0 and apply Chebyshev’s in- 
equality to Y = exp(s(X, +--+ X,)). 


Problem IV-7. Let {X,}°2, be a sequence of nonnegative real random 
variables with the same distribution, such that X,; and X,, are independent 
for every pair (j,n) with 7 # n. Assume that E(X,) < oo. Set S, = 
jai Xi Yn = Xn ex, <n}, and $7 = ae Y;. The goal of this problem 
is to prove the law of large numbers: 


Pine ae) ak 
no nN 


(1) Using Problem I-6, show that E(X,) < oo implies 0, P[Xn # Yn] < 
oo. Using the Borel-Cantelli lemma (I-5.2.8), conclude that lim, —.o(Sn — 
S*) exists with probability 1. 

(2) Show that limn... 4E(S*) = E(X1). 
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(3) Let @ be a real number greater than 1 and let k, be the integer 
part of a”. Prove the existence of a constant C; such that S“{k7? 
n such that ky, > 7} < C\j~?. With the help of Bienaimé’s inequality, 
conclude that 


| || > < 2 L pe 


Then prove that )77") 3 2 E(Y? )<o. 
(4) Deduce from (2), (3), and the Borel-Cantelli lemma that 


then from (1) that 


(5) Prove that, for every a > 1, 


n—- CO 


Pa “1E(X1) < lim inf = < limsup = < oB(X;)| = 1 = 


Deduce the law of large numbers from this. 


REMARKS. The elementary proof whose outline is sketched here is due to 
N. Etemadi (1981). 


Problem IV-8. Let {Xy,}°2, be independent real random variables with 
the same distribution and such that E(X,) = 0 and 0 < E(X?) < oo. Let 
Sn = X1 +--+ Xq. 

(1) Show that limp... P(S, > 0) = $ by using Laplace’s theorem (IV- 
4.3.1) and Problem [I-22. 

(2) Use the preceding result, and the weak law of large numbers proved in 
Problem IV-6(3) (that limy_.o6 PI > €] = 0 for all € > 0) to show that 


limn—oo [E(exp(—S,)1,5,, >0})]1/” = 


Problem IV-9. (1) If X and Y are independent real random variables, 
show that ae +Y >a+b)> P(X >a)P(Y > b) for all real a and 6. 
(2) Let {X,,}2, be a sequence of real independent random variables with 
the same iste banon: and set Sg = 0 and S, = X,+---+Xy. Let s bea 
fixed real number. Set py = P[S;, > ns]. Show that prim > PnPm for all 
m,n > 0 and that, for n > 0, p, = 0 if and only if p) = 0. 

(3) If the sequence {a,}%9 of nonnegative real numbers is such that 
AQntm 2 Gn + Gm for all m, n > 0, show that lim,—.. “ = infyso 4. 


Conclude that limp—o WPn = a(s) exists. 
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Problem IV-10. Let {X,,}°2, be a sequence of independent real variables 
with the same distribution. Suppose that y(t) = E(exptX1) exists for all 
t in an open interval J containing 0 and fix a real number s > E(X) 
such that t + e~‘Sy(t) attains its minimum a(s) at a point 7 of J. Let 
Sy = Xypt--- + Xn. 

(1) Show that log y(t) is convex on J and that 7 > 0. Conclude, using 
Chebyshev’s inequality (Problem IV-6), that 


o(S20)f st 


(2) Let yy be the distribution of X1 — s and let v(dz) = oa (de). Prove 
that v is a probability measure, that fxv(dr) = 0, and that f 2?v(dz) < 
oO. 

(3) Let {Z,}92, be a sequence of independent random variables with the 
same distribution v. Show that 


P |= > | = (a(s))"Blexp(—r(Zi +--+ Zn))1z,4-+2020}): 


Conclude from Problem IV-8 that 


(4) Compute a(s) in the following cases: 
(a) p(t) = exp(t?/2) (normal distribution) 
(b) y(t) = cosh(t) (Bernouilli distribution) 
(c) y(t) = (1-t)7*%,t<1,a@>0 (gamma distribution) 
(d) y(t) = exp A(e’ — 1), A>0 (Poisson distribution) 
(e) p(t) = (2a), ptq=1,0<p<1,t<-—logg,a>0 
(negative binomial distribution) 
(f) p(t) = z4z, lt] <1  (Laplace’s first distribution) 
(g) y(t) = =4; (logarithm of a Cauchy distribution) 


REMARK. It is not known what conditions on a decreasing function a on 
R are sufficient for the existence of a distribution yz of the X, such that 


dim |P (= > 5) eon 


Problem IV-11. If z; and zg are complex numbers with positive real 
part, set I(z1) = [f° a™~le"*dx and B(z, 22) = fo x-l(1 — 2)?2—1 dz, 
Assume without proof the formula 


P(2)P (2) 


B = i 
(21, 22) (24 ne Z2) 
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If a and 6 are positive, the probability measures 


pce AOE 
Ya(dx) = 1(9,420)()a*~'e~™ [(a) 
38 p et a 
Bap(de) = Lo1y(@e)a*~ (1 — 2) B(a,b) 
dx 


2) a Chek, 
pO} (de) = lossy ee iar 


B(a,b) 


are called, respectively, the gamma distribution with parameter a and the 
beta distributions of the first and second kind with parameters a and b. (1) 
If isa pounded measure on (0, +00), its Mellin transform is (My)(t) 

— Safe x'' (dx) for t real. (This is the Fourier transform of the image of ju 


under x + loga.) Compute My,, Ma», and MB). (2) If X is a random 
variable with distribution 3,,, compute the distribution of X/1—X. (3) 
If X and Y are independent r.v. with distributions y, and y, compute the 


distributions of X/Y and X/(X +Y). (4) If X, Y, and Z are independent 


2) 


2 tb.c: compute the distributions 


r.v. with distributions 64.5, Ba+y.c, and gi 
of XY and XZ. 


Problem IV-12. (1) Let yg be the probability measure of Problem [V-11, 
with a > 0. Compute its Fourier transform. If X and Y are independent 
random variables with distributions y, and +, compute the distribution of 
X+Y. 

(2) Let X be a Gaussian random variable with density e Tz 


eo 8? /207 dx. 
9 \it 
Compute E \(#2) | for t real, and use Problem IV-11 to find the distri- 


bution of a 
(3) Let Xy,...,Xa, Yi,..-, Ym be independent random variables wt the 
same distribution as X of (2). Compute the distribution of 54z[X7 + 

XP +NG 
Y, zee +Y2 


m 


.. + X39] by using (1) and (2), and the distributions of and 


AG by using Problem [V-11(3) 
XP f+ X24Y2 4. FV? : 


Saeen IV-13. In Euclidean space R?*!, consider a random variable 


= (Xo, X1, , Xa) whose distribution py is invariant under every orthog- 
al matrix of Rel and une es - y Let v denote the distribution 
of X on (0,-+00) and let ¥Y = (2 x, oS eee a 


(1) Use Problem III-2 to show that ‘the dist sbutan of Y is independent of 
Vv. 

(2) From now on, assume that the {X,}'/_ are independent, with the same 
distribution 2 and with Fourier transform exp(—§). Show, using Problem 
I[J-1, that js must be invariant under every orthogonal matrix. 
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(3) If a € R, compute the integral 


I(a) = = [- exp |-5 ( + =) dx 


by using the following fact from Problem II-12(1): 


+00 \a| +00 f 
/ f(z -— —)dz = / f(y)dy for every f integrable on R. 
—oo x —oo 

(4) By first conditioning with respect to Xo (see Problem IV-34), compute 
the Fourier transform of the distribution of Y. 

(5) Using Problem IV-11, find the distribution of |/Y||?. Derive the density 
of Y from this, by observing that the distribution of Y is invariant under 
every orthogonal matrix in Og and using Problem III-3. 


Problem IV-14. Let yq be the probability measure of Problem IV-11, 
with a> 0. 
(1) Use Problem IV-12 to compute limg—.oo fy~ exp[it(*)}ya(dz). 


va 
(2) Using Problem I-14, show that 


~ 2-4 to 2,5 ax 
lim — dx) = gies (2. 
slim, [OP lnlde) = fle 


(3) Integrate by parts to compute the integral ie |= lva(de) and prove 
Stirling’s formula: 


qt 1/2e-4 1 


lin, es Se 
(ares I'(a) V2 
Problem IV-15. (1) Let « be a probability measure on R such that 
L(t) = fé(t cos @)ji(tsin @) for all real t and 6. Show that there exists o > 0 
such that f(t) = exp(-Z#). 


METHOD. Show that p(t) > 0, then that f(t) > 0 for every t. Finally, 

consider f(x) = —logfi(./z) for xz > 0. 

(2) For positive integers d, and dz, let 4 and pz be probability measures 

on the Euclidean spaces R® and R® such that v = Ly ® pl is invariant 

under the group G of orthogonal matrices on R“ +“. Show that there 
2 

exists 7 > 0 such that f2;(t) = exp(— ltl), 7 = 1, 2, where ||¢||1 and ||¢\l2 

are the norms in R® and R®. 

METHOD. Use Problem III-1 and part (1) of this problem for the case where 

d, = d2 =1. 


REMARK. The converse of the property in (2) is trivial. This characteriza- 
tion of centered normal distributions is sometimes called Maxwell’s theo- 
rem. 
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Problem IV-16. A real random variable Z is called symmetric if Z and 
—Z have the same distribution. 
(1) Show that Z has distribution c(dz) = Fee if and only if Z is symmet- 
ric and |Z|? has distribution 3) (1/2, 1/2). Ose Problem IV-11.) Assuming 
without proof the formula [(z)?(1—z) = =*= for complex numbers z such 
that 0 < Rez < 1, compute E(|Z|‘‘) for real t in this case. 
(2) Let X; and X» be two real random variables that are independent 
and symmetric, and have distributions 4; and pl such that j)({0}) = 
2({0}) = 0. Show that Z = ¥ has distribution c in the following cases: 
(a) pu (dz) = io(dr) = exp(—2?/2)dx/ vIn 
(b) |X|? has distribution 6(3,6) and |X| has distribution 3°) (4, $+0) 
(c) ju (de) = pa(de) = V2/rde/(1 +2") 
- S Giory at X41 
(3) With X; and X2 as in (2), deduce from (2a) that U (AS 


) is uniformly distributed on the unit circle of Euclidean space 


X2 
V XTX 


R? if and only if Z = 42 has distribution c. 


REMARKS. Example (2c) is due to tale (1949). Moreover, if (X 1, X2) is as 
in (2) with U uniform, then ( xy : XS =~) has the same property. 


Problem IV-17. A probability measure v on a Euclidean space R4 is 
called isotropic if v({O}) = 0 and the image of vy under the mapping 
a Te in the unit sphere Sj,_; of R’, is the unique rotation-invariant 
probability measure og_; on Sg_1. It is called radial if its image v, in R 
under the mapping x +> (a,x) does not depend on a when a ranges over 
the unit sphere. 
(1) Let fz, and jz be probability measures on the Euclidean spaces R® and 
R“?, with d; and dz positive. Show that the probability measure v = 14,®p12 
on the Euclidean space R“ +4 is isotropic if and only if 4; and pa are 
radial and if, for every a, in Sg,-1 and a2 in Sq,—1, the image of v under 


a . d. 
(1,02) > $222 is c(dz) = sytem. 


METHOD. Prove the assertion first for d; = dy = 1 and use Problem IV-16. 
(2) Let (X1, X2, X3) be three independent random variables such that the 
distribution v of (X1, X2, X3) in R® is isotropic. Show that there exists 
a > 0 such that 


ot? 
Elexp(itX,)] = exp (-S) for 7 =1, 2,3, andteER. 
METHOD. Apply (1) to the distributions yz, of X; and pig of (Xz, X3) and 


use Problem IV-14. 


REMARKS. The converse of (1) is true but rather lengthy to prove. (2) is 
true for n independent random variables, n > 3; this follows easily from 
the problem. (Problem IV-16 showed that this would be false for n = 2.) 
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This property of the normal distribution is due to I. Kotlarski (1966), who 
proves it with the additional hypothesis that the X; are symmetric. 


Problem IV-18. Let E£ be a finite-dimensional real vector space, let E* 
be its dual, and let (x,t) be the canonical bilinear form on E x E*. If us is 
a probability measure on £, its Fourier transform is defined on E* by 


ji(t) = I exp(i(z, t))4e(da). 


(1) If there exists to # 0 such that |ji(to)| = 1, show that yu is concentrated 
on a countable union of affine hyperplanes and determine them. 


METHOD. First consider the case where fi(to) = 1. 
(2) If there exists a probability measure v on E such that ji(t)p(t) = 1 for 
every ¢t in E*, show that w and v are Dirac measures. 


METHOD. First prove this when dim E = 1. 


REMARKS. This result can be generalized by replacing H and E* by a 
locally compact abelian group and its group G of continuous characters x. 
(See III-1.4.) 


Problem IV-19. Let X,, X2, Y;, and Y2 be independent real random 
variables such that Y; and Y2 are strictly positive and E[exp(itX;)] = 
exp(—t?/2) for j = 1,2 and t real. Let R = [X?Y? + X2Y?]1/?. Using 
Problems IV-16 and IV-18, find the distributions of Y; and Y> such that 
U = (X1Yi/R, X2Y2/R) is uniformly distributed on the unit circle of R?. 


Problem IV-20. Let og_; be the uniform probability measure on the unit 
sphere Sqg_, of the Euclidean space RZ, and let vg be the image of wg under 
the dilation + Vdz. 

Prove that vg converges narrowly to v(dx) = exp(—2x?/2)dz/./2r. 


METHOD. If Yj,..., Ya,... is a sequence of independent random variables 
with the same distribution v and if Ra = [Y? +---+ Y?]!/?, use the fact 
that og—1 is the distribution of sie Oe Y2,..., Yq), the weak law of large 
numbers of Problem IV-6, and Problem I-10. 


REMARK. This property of uniform distributions on spheres is known as 
Poincaré’s lemma. 


Problem IV-21. Let S,, denote the set of probability measures y on R 
such that there exists a probability measure jz, on the Euclidean space R” 
whose image in R under x +> (a,2) is ys for every a in the unit sphere 
of R”. Prove that yw € N°L,S,, if and only if there exists a probability 
measure p on [0, +00) such that the Fourier transform of yu satisfies fi(t) = 


i exp(—““) (dy). Prove that such a 9, if it exists, is unique. 


METHOD. For the uniqueness of p, use Problem IJ-20. For its existence, use 
Problems ITI-1, ITI-2(4), and IV-20, as well as Paul Lévy’s theorem on the 
convergence of distributions. 


306 Exercises for Chapter [V 


REMARK. This property is due to I. Schoenberg (1937) . 


Problem IV-22. Let (Xo, .X1....,Xq) be an R?*! valued random variable 
that is radial, i.e. whose distribution is invariant under the group O41 of 
dx d orthogonal matrices. Let t = (t1,tz,...,tq) and |Jt|] = [2 +---+4#3]2. 
Prove that Efexp(i ee t;Xj — ||t|Xo)] = 1 for every t in R% such that 
Elexp(—||t||X0)] < 00. 


METHOD. Prove the assertion first for d = 1 and , concentrated on the 
unit circle. 


Problem IV-23. Let {(V,,.W,)}°, be a sequence of ea random 
variables with the same distribution, with values in R x R@ (where R” has 
the Euclidean structure), and satisfying E[log |V; |] < 0 and Eflog* ||W,||] < 
x. 

(1) Prove that S>>* 


r_o Mi +--+ Vn} ||Wr4il] converges almost surely. 


METHOD. Use the Borel-Cantelli lemma to show that limsup,,_,., ||Wn|[!/” 
<1, then use the strong law of large numbers. (See Problem IV-7.) 

(2) Let yx be the distribution of oo R‘-valued random variable which is 
equal to the sum of the series ee 9 M1 ---VnWnsi. Let v be a distribution 
on R¢ whose Fourier transform 7 a 


p(t) = E[0(Vt) exp(i(W),t))] for every t in R¢. 


Show that uw =v. 

(3) Let {U,,}2 5 be a sequence of independent R“*!-valued random vari- 
ables with the same distribution, the uniform distribution on the unit 
sphere S, of R@*!. Let V, — 1 and W,, be the projections of U, onto 
(R,0,0,...) and ee its orthogonal complement. Prove that if yu is the 
distribution of S079 Vi... VaWn4i, then f(t) = exp(—|l¢||). 


METHOD. Use (2) and Problem IV-22. 

(4) Let {X;, }>, be a sequence of independent random variables with values 
in N = {0,1,2....} and with the same distribution, such that X, satisfies 
pr = P[X, = k| < 1 for every & in N. Set qh = P[X1 < hk]. Show that 
if w is the distribution of Seaee PX, PX» -+-PX,9X,4,, then p is Lebesgue 
measure on [(), 1]. 


Problem IV-24. Let X and Y be independent random variables with the 
same distribution and with values in Euclidean space R%, d > 1, which 
satisfy the following conditions: (i) P[X = 0] = 0; (ii) hall and ||X|| are 
independent; and (iii) eal is uniformly distributed on the sphere Sy_}. 
(That is, the distribution of X is “radial” —- see Problem III-2(4).) Prove 
that 


— 


PIRX -YI<IVI< 3 
and that this inequality is the best possible. 
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METHOD. Consider R = Tae use the fact that R and R7! have the same 
distribution on (0,-+co), and prove the inequality by first conditioning with 
respect to |log R|. For the second part, take ||X|| with density Ag(-n)/n 
on (0, 1] and show that the distribution v,, of exp(—|log R|) tends vaguely 
to the Dirac measure at 0. 


REMARKS. 1. There is also an explicit expression, 
1 oO 
Puex-viisivil=7 f° cava), 
0 


where G(y) = mE ED Pe aaa and v*(da) is the distribution of 
A?/(1 — A?). 

2. This inequality is due to A.O. Pittenger, who proves it with the additional 
hypothesis P|||X|| = x] = 0 for all x > 0 (1981). 

3. Relaxing the hypothesis of the problem to P|||X || = 0] = 0 easily yields 
the upper bound 


1 
PU|2X -¥I| < VI <p+ (0-5, 


where p = P[||X|| = 0] < 1, and this again is best possible. Note also that 
P{||2X — Y]| < |Y ||] = (1 — p)?/4 < 1/4 in all cases. 


Problem IV-25. Let H be a separable Hilbert space and let py denote the 
orthogonal projection of H onto a subspace U. Define the Boolean algebra 
B of subsets B of H for which there exists a finite-dimensional subspace V 
of H and a Borel set By of V such that B = py'(By). Let o(B) denote 
the o-algebra generated by B. 

(1) Show that {z : |x|] <r} € o(B) if r > 0. 


METHOD. Use the fact that, since H is separable, there exists an increasing 
sequence {V,,}°2, of finite-dimensional subspaces of H such that US_,V;, 
is dense in H. 

(2) A cylindrical probability on H is given by probabilities zy on each finite- 
dimensional subspace V of H such that, if V; C Vo, the image of wy, under 
pv, is ty,. For B € B, let Eg denote the set of finite-dimensional subspaces 
V such that there exists a Borel subset By of V with B = p;'(By). Prove 
that V + py(By) is constant on Eg. Denoting this constant by (B), 
prove that p is finitely additive on B. 

(3) Consider the cylindrical probability defined as follows. Let p be a prob- 
ability measure on [0, +00) and let py be defined by its Fourier transform, 


foe) 2 2 
Av(e) = [ explite,t)uv(de) = [ exp(- Er) p(dy) forte V. 


Show that 4 is not o-additive on B if p({0}) < 1. 
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METHOD. Otherwise 4: could be extended to a o-additive probability mea- 
sure 4 on o(B). Use Problems I-10 and IV-6 to show that this would imply 


u({z : lla] <r}) = p({0}) for r > 0. 


Problem IV-26. In Euclidean space R”, consider the positive quadratic 
form q defined by q(x) = Sof) Anz, where x = {x,}"_, and Ap > 0. Set 


llall = Dopaa Ak: 
(1) If X is an R”-valued random variable such that 


Blexp(i(xX.0))) = ex(—EL), 


show that Plq(X) > r?] < Hall for every r > 0. 
METHOD. Use Chebyshev’s inequality, Problem IV-6. 


(2) Let ye be a probability measure on R” with Fourier transform ji(t) = 
Je exp(i(x,t))u(dz) and let e > 0 be such that |1 — fi(t)| < € for every t 
in R” with g(t) < 1. Prove that, for every r > 0, 


ye ex (- er) y(dr) >1—e- _ 


(3) Prove that, for every r, R > 0, 


2ilall R? 
u({z : ||al| < R} 2 1—e——]> — exp ~ Op? , 
Conclude that there exists a number R(||q||,¢) such that 


uC: |e] < Rillall,} 2 1 2¢. 
REMARK. This result is called Minlos’s lemma (1959). 


Problem IV-27. The notation is that of Problem IV-25 and pw = (uv )v 
is a cylindrical probability on H. A positive quadratic form q on H is a 
bounded linear mapping A : H — AH such that q(x) = (Az,x) > 0 for 
every x. If the dimension of V is n, there exist a basis b = {b),...,bn} 
of V and nonnegative numbers \j,...,A, such that, if ae tpbp is in 
V, then g(x) = S7p_, Anz. Moreover, the distribution of the {A,}R_, is 
independent of b, and we may set ||gy|| = S°¢_, Ax. This implies that 
llavi ll S llavell if Vi C V2, and we set ||q|| = supy |lqv || < +00. 

(1) Let fiy(t) = fi, exp(i(x, t))wv (dz) for t € V. Show that fry, (t) = fiv,(t) 
ifteE Vi NV. 

(2) Set f(t) = fiv(t) if t € V. Suppose that, for all « > 0, there exists a 
positive quadratic form gq. on H such that ||g.|| < oo and |1 — fa(t)| < € for 
all ¢ such that q¢.(t) < 1. Deduce from Problem IV-26 that, for all « > 0, 
there exists R(e) such that 


by ({2:a€V and |a|| < R(e)}) > 1-2 for every V. 
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(3) With the preceding hypotheses, prove that jy is a o-additive probability 
measure on the Boolean algebra B by showing that, if A, € B, An D Ans, 
and p(A,) > 6 > 0 for every n, then N&,An #0. 


METHOD. Let V,, be a finite-dimensional subspace of H containing a Borel 
set A’, such that A, = py, (Aj) and let B/,(R) be the closed ball of radius 
Rin V,. We may assume that V, C Vp41. Construct compact sets K/, of V, 
contained in A‘, B/,(R), introduce K, = py. (Kj,), and use the fact that 
Crh = Knn...NK, {zx : ||z|| < R} is a decreasing sequence of compact 
sets in the weak topology on H. 

REMARK. This result is due to Minlos (1959). 


Problem IV-28. Let {Xn }n>1 be a sequence of independent random vari- 
ables with the same distribution defined by P[X, = 1] = P[X, = -1] = 
1/2. Compute the limiting distribution as n — oo of 


Vo [Tate oi 7 ee | 


METHOD. Consider the characteristic function of Y,,. 

REMARK. This is a simple special case of Lindeberg’s theorem, which is a 
significant generalization of Laplace’s theorem, IV-4.3.1 (also often called 
the central limit theorem). Lindeberg’s theorem is stated as follows: If (i) 
the real random variables {X,,}°, are independent (but do not necessarily 
have the same distribution); (ii) for every n, E(X,) = 0 and o? = E[(X, + 
-++ 4+ X;,)°?] < 00; and (iii) for every e, 


DBE (*) 0, where f.(2) = 27 Le 420)(2). 


k=1 th 


E 


then the distribution of > (X,+---+X,,) tends to the Gaussian distribution 
N(0,1) as above. 


Problem IV-29. On the real line, consider the Gaussian distribution 
2 

u(dz) = x exp(4-)dx. Let L?() be the Hilbert space of functions which 

are square integrable with respect to yz, with the scalar product 


+00 = 
(f,9) = i: f(x)g(a@)u(de). 


The Hermite polynomials {H,,(x)}°2, are defined by 
ys HA, (x)(it)” = exp(ita + 5) =y(t,xz) VtEC. 
n=0 

Assume without proof that this implies 


a So [Hala] le" < exn(Iel lal + PY 


n=0 2 
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(1) By computing (y(t, .). e(s, .)) in two different ways, show that (Hy, Hm) 
= 0 ifn Amand that (AH,, Hy) = 4. Use the uniqueness of the Fourier 
transform to show that if f in L?() satisfies (f, H,) = 0 for every n, then 
f=0. 

(2) Show that H!_,(7) = H,(a) and that (n+ 1)Hn4i(x) = cA, (x) - 
Ay-\(x) ifn >1. 

(3) Let f € L?(u) and let f, = nl(f,Hn). Show that f = 75 fall 
(where the convergence of the ae . in Ms ae sense). If, moreover, f’ 
exists (in the sense that F(x) y+ Loh f'(t)dt for every x) and belongs 


to L? (2), show that f’ = =, ae 

METHOD. Compute (f’, Hp) by means of an integration by parts and (2). 
(4) Prove H. Chernoff’s inequality: If X is a Gaussian random variable with 
distribution p and if f is a real-valued function such that both f’E]] f’(X)|?] 
and E||f(X)|?] exist, then E[|f’(X)|?] > variance of f(X). Analyze the 
case of equality. 

Problem IV-30. Let (X,Y) be a Gaussian random variable with values 
in R? such that X and Y have distribution p(dr) = (20)~!/? exp(—5)dz. 
(1) For the Hermite polynomials defined in Problem IV-29, prove that 


H,(ycos@ + zsin@) = » Hi,(y) cos* @Hy—4(z) sin” * 6. 
k=0 


(2) Assume that cos@ = E(XY) 4 +1 and define the random variable 
Z= Ao Verify that Y and Z are independent and use (1) to 
in 
prove that E[H,,(X)|Y] = H,(Y)(E(XY))”. 
(3) Prove Gebelein’s inequality: If f € L?(j) with E(f(X)) = 0, then 
B[E[f(X)|Y])7] < (E(XY))PE(f?(X)). 


Analyze the case of equality. 
METHOD. Write f = )>~_, fn Hn as in Problem IV-29. 


Problem IV-31. Let H, be the nth Hermite polynomial described in 
Problem TV-29 and compute 


/ © ett Hy (aye PE = Felt) 
ae 2 n oe nb 


Use this to find 


P+ OG 
| eittyNe 2? (2g Ge dx 
a V2 


Problem IV-32. Let (Q,.A,P) be a probability space and let 8 be a sub- 
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a-algebra of A. We would like to show that if X € L'(.A), then 
(x) i. XdP 7 E[X|B]dP for all Be B, 
B B 


and that (*) characterizes E[X |B]. 

(1) Show that (*) holds if X € L?(A). 

(2) If X > 0, let L(X) = limy—.4o0 Elmin(X,n)|B]. If X € L(A), let 
L(X) = L(X*) — L(X~), where X+ = max(X,0) and X~ = max(—X,0). 
Show that L(X) € L'(B) and that [,(X — L(X))dP =0 for all B in B. 
(3) Show that if f, g € L'(B) are such that [,(f — g)dP = 0 for every B 
in B, then f = g. 

(4) Show that L(X) is a bounded linear operator from L1(.A) to L1(B) and 
infer that L(X) = E(X|B). 


REMARK. This characterization of conditional expectation is often taken 
as a definition in the literature. 


Problem IV-33. Suppose that, for every n > 0, X, € L'(.A) and X, > 0. 
Use the preceding problem to show that if X, 1 Xo, then 


Yn = E[X,|B] 1 E[Xo|B]. 


Problem IV-34. Suppose that (Q,.A, P) is a probability space, B is a sub- 
o-algebra of A, Y is a B-measurable random variable, and X is a random 
variable independent of B. Consider f : R? — R such that f(X,Y) is 
integrable. The goal of this problem is to show that if yu is the distribution 
of X, then 


+00 
(+) BUX. Y)B|= f f2,¥)ulde). 

(1) Show that (*) holds if f(x,y) = 1;(x)1;(y), where I and J are Borel 
subsets of R. 

(2) Let P be the Boolean algebra on R? consisting of sets of the form 
E = Us_1Iy x Jp, where I, and Jp are Borel subsets of R. Show that (*) 
holds if f(z,y) =1xe(2,y) with Ee P. 

(3) Let M be the family of Borel subsets M of R? such that f(x,y) = 
1u(x,y) satisfies (*). Show that M is a monotone class. 

(4) Prove (*) successively for the following cases: (a) f is a simple function 
on R?; (b) f is a positive measurable function with f(X,Y) integrable; and 
(c) the general case. 


Problem IV-35. On a probability space (Q,.A, P), consider an integrable 
random variable X and a sub-o-algebra B of A, both independent of an- 
other sub-c-algebra C of A. Prove that if D is the o-algebra generated by 
BUC, then 

E[X|D] = E[X|B]. 
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METHOD. Prove the assertion first for square integrable X. 


Problem IV-36. If X and Y are integrable random variables such that 
E[X|Y] = Y and E[Y|X] = X, show that X =Y as. 


METHOD. Show that, for fixed x, 


(2) 0< i, (X —~Y)dP = / (Y — X)dP, 
Y<xr<X z<X and «<y 


and conclude by symmetry that both sides of the equation are zero. Then 
use Problem I-13. 

Problem IV-37. Suppose that (Q,.A,P) is a probability space, X and Y 
are integrable random variables, and B is a sub-o-algebra of A such that 
X is B-measurable. 

(1) Show that E[Y |B] = X implies E[Y|X] = X. 

(2) Show by a counterexample that E[Y|X] = X does not imply that 
Ely |b] = xX. 

REMARK. If {A,}n>o is a filtration of (Q,A,P), {Xn}noo a sequence 
adapted to this filtration, and B, the o-algebra generated by Xo,..., Xn, 
then {X,,Bn}n>0 is a martingale if {X;,,An}n>o is. The converse is false. 
Problem IV-38. Let (Yo, Yi,...,¥n) be an (n + 1)-tuple of real random 
variables defined on a probability space (Q,€,P). Let F denote the sub- 
g-algebra of € generated by (Y;”),...,¥{“) = f(w) and assume that 
E({Yo|) < oo. 

(1) By applying Theorem IV-6.5.1 to f, show that there exists a Borel- 
measurable function g : R" — R such that 


E[Yo|F] = 9(%1, Yo....,¥,) P-almost everywhere. 


(2) Assume that the distribution of (Yo, ¥1,...Y,) in R"*? is absolutely 
continuous with respect to Lebesgue measure dyo, dy1,...,dyn, and let 
d(yo,Y1;---;Yn) denote its density. Prove that 


+oo 
E(YolF) = [K(M1,¥2,---. Ya) i. ee eee 


where K(yi,.--;Yn) = fre d(yo, Y1;--+,Yn)dyo. Prove that if A is a Borel 
subset of R, then 


P[Yo € A\F] = E[1ly,¢4|B] 
Kin Yall f dlao, Yin Yale. 


(3) Assume that the distribution of (Yo, Yi1,...,¥,) in R”t! is Gaussian 
(with the definition in IV-4.3.4, which implies that E(Y;) = 0 for j = 
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0,...,n). Use the observation that if (X,¥i,...,Yp) is Gaussian in R"*?, 
then X is independent of (Yi,...,Y,) if and only if E(XY;) = 0 Wj = 
1,...,n, to show that there exist real numbers ;,..., An such that E[Yo|F] 
=AN% +++ +AnYn. 


Problem IV-39. Let {X,,} be a sequence of independent real random 
variables with the same distribution and let F,, be the o-algebra generated 
by X1,...,Xn. Set Sp = X1+---+X,y for n > 0 and set So = 0. Which of 
the following processes are martingales relative to the filtration {F,}?29? 
(1) S,, if E(|X1|) < co. 

(2) X2+--.4+ X?2 — nd, if E(X?) < 00 and 2 is real. 

(3) exp(aS, — nd), if p(a) = E(exp(aX1)) < co and @ and X are real. 

(4) Yn = |Smin(n,r)|, Where T = inf{n > 0: S, = 0}, and we assume that 
PIG =A = PX 1] =e. 


Problem IV-40. Let ¥i,...,¥;,... be independent real random variables 
with the same distribution and such that E[|Yi|] < oo. Set S$, =Y1.+---+ 
Yn. 
(1) Show that E[Yi|S,] = Sp/nifl <k <n. 

(2) If m is fixed and X_ = Sm_—z/(m — k) for 0 < k < m—1, show that 
(Xo,---,;Xm-_1) is a martingale. (Apply Problem IV-35.) 


Problem IV-41. Let {X,}92, be a sequence of independent random 
variables with the same distribution defined by P[X, = k] = 27-* for 
k = 1,2,.... Random variables Z, are defined by letting Zp be a positive 
constant and setting Z, = (3Z,_1)/2*" for n =1,2,.... 

(1) Prove that {Z,,}°Z > is a martingale relative to the filtration {F,}?2o, 
where F,, is the o-algebra generated by Xj,..., Xn. 

(2) Use the law of large numbers (see Problem IV-6) to prove that Z, — 0 
almost surely as n — oo. 


REMARK. This gives a heuristic confirmation of the following unproved 
conjecture in number theory. If n is an odd positive integer, let f(n) = 
(3n + 1)2-"8"+)), where 2”(3"+1) denotes the greatest power of 2 that 
divides the integer 3n + 1. The conjecture asserts that, for every n, there 
exists an integer k such that the kth iterate of f satisfies f)(n) = 1. If 
n is very large, v(3n + 1) appears to behave like the variable X, of the 
problem, and {Z,}?°., like the sequence { fx(n)}?2,. 


Problem IV-42. Let H c L1(,A,P), where (Q,.A, P) is a probability 
space. 

(1) If F is a positive function on (0,+00) such that F(x)/x is increasing 
and — +00 as n > oo, and if 


sup E(F\h) = M < ov, 
heH 


show that H is uniformly integrable. 
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METHOD. Use Proposition IV-5.7.2. 
(2) If H is a bounded subset of £?(Q,A,P) with p > 1, show that H is 
uniformly integrable. 


Problem IV-43. Let {X;, Dt be independent random variables with 
values in N and with the same distribution. Assume that 0 <m = E(X11) < 
oo and that 0? = E((X1; — m)?) < oo. Consider the sequence of random 
variables defined by 


Zy+1 = 0 if Zn =0 
Bec: a So Renn - A SG 


F,, is the o-algebra generated by {X;;:1<i<oo, 1<j <n}. 
(1) Show that {Z,/m”".F,,}°, is a martingale. 
(2) Show that E (Z2,,/m2+)) = B(Z2/m?") + 0?/m?"}. 
Conclude that, ifm > 1, the martingale is regular. (Use Problem IV-42 
and Theorem IV-5.8.1.) 


REMARK. {Z,}%, is sometimes called the Galton-Watson process, and 
serves as a model in genetics. (X;,; is the number of offspring of the indi- 
vidual i of the jth generation, which has total size Z;.) 
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Problem V-1. Let E be the set of compactly supported C™ functions on 
R, and let d and 6 be the operators on F defined by 


(dy)(z) =y"(x) and (éy)(x) = —¢'(x) + xp(z). 
(1) Prove by induction on n that 
d"§ — 6d" =nd"—. 


(2) Let p be a norm on E. Let B be the algebra of operators on E which 
are continuous with respect to this norm, that is the set of endomorphisms 
a of & such that 


lla|| = sup{p(a(y)) : p(y) < 1} 
is finite. Assume that d and 6 are in B. Use (1) to prove that, for all n > 1, 


IInd" |] < 2|]a"~*]| [lal] [161]. 


(3) Deduce from (2) that d and 6 are never simultaneously continuous. 
REMARK. This result is due to Aurel Wintner (1947). 


Problem V-2. Let {H,,}°_, be the sequence of Hermite polynomials de- 
fined in V-1.3. 
(1) Use Proposition V-1.3.4 to show that, for n > 1, 


An+1 + nHyn-1 => tH. 
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If Hy, = fhe (compare with Problem IV-29), show that 


Bs 1 a. 
An +zi = 


H, Ay -1. 
rw+1 n+l 1 


it 
n+1 
(2) Conclude from (1) that the radius of convergence R(x) of )77~_9 =H, (a) 
is too for every complex number z. 


METHOD. For (2), show that for every € > 0 there exist an integer N(e) and 
a sequence {nr} Pn (e)-1 such that |H,| <a, and tn41 = €&n + €Lp-1. 


Problem V-3. Let {H;,,}%<o, d, and 6 be defined as in V-1.3. For nonneg- 
ative integers n, consider 


F, (x) = Hy(iv)(-2)". 


Let \ € C and define p by p = 6 + Ad. 
(1) For n > 1, prove that d"p = pd” + nd"~! and Fryi = 2F, + nFy-1. 
(2) Prove by induction on n that 


(d+p)” = >_> ChHg(p)a"*, 
k=0 


where C* denotes the binomial coefficient. 
(3) If y is a polynomial and ¢ is real, let 7(y)(x) = y(« + t). Prove that 
(exp(td))(y) = 74(y) and that 


2 
fexp t(d+ p)(¢) = (exp 5 expltoin) () 


In particular, if \ = 1 (that is, if p(w)(x) = ry(x)), compute (exp t(d + 
x))(~). 


REMARK. The result of (2) is due to Viskov*; that of (3) is due to Ville.® 


Problem V-4. Let X and Y be independent random variables with the 
same distribution (dr) = exp(—a?/2)dx/V/27. Let g : R — [0,+00) be 
a measurable function and let Z = X + Y./g(X). Assume that Z has a 
normal distribution. Cantelli conjectured in 1917 that g is then constant 
almost everywhere; this is still unproved in 1994. 

(1) Let go = E(g(X)). For all real t, compute E(exp tZ) as a function of 
go. Prove that exp(ag) € L?(v,) for all a > 0. 


METHOD. Use the Cauchy-Schwarz inequality. 
40. Viskov, Theory of Probability and Its Applications, Vol. 30, n. 1 (1984), 


141-143. 
5J. Ville, Comptes Rendus Acad. des Sc. 221 (1945), 529-539. 
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(2) Let {gn}%9 be the sequence of real numbers such that g(x) = 
9 gn 2 in the L?(v,) sense. By considering E(Z*) and E(Z*), show 
that g: = 0 and —2go = 7°... g2/nl. 

(3) Prove that g(x) < go + 1 almost everywhere. 


METHOD. If € > 0, let A, = {x : g(x) > € + 90 + 1}, and let a be a real 
number such that Al = A, [a,+0o) has positive measure. Consider 


t2 
i; expléz + > (g(x) — 1 — go)|v1 (dz). 
A 


, 
€ 


Problem V-5. As usual, we denote by {Hn}n>o0 the sequence of Hermite 
polynomials and by 1, the normal distribution on R. Let py be a probability 
distribution on R? such that if (X,Y) has distribution p, then X and Y 
have distribution 1; and there exists a real sequence {Cy }n>o0 with 


E(An(X)|¥) = CnHn(Y¥). 


(1) Prove that C, = E(H,(X)H,(Y)) and -1<C, <1 for all nin N. 
(2) Prove that if 3>,.,C? < +00, then p is absolutely continuous with 
respect to v;(dx)v; (dy) and its density is 


Fla; y) = a CH, (0)Hnl). 


n>0 


METHOD. For (2), write u(dz,dy) = 1, (dy) K(y, dx). Show that the func- 
tion c + f(x,y) is in L?(v,) y-almost everywhere and that, for every 
GEC, 


i exp(6x)(f(x,y)ri(de) -— K(y,dz)) =0 yae. 


Problem V-6. We keep the notation of Problem V-5 and denote by C 
the set of probability measures js on R? described there. Let py be a fixed 
element of C. 
(1) Define {on b}o<k<n by 
n 
= So bn, ee (2) 
k=0 
and let 


Prly) = S~ bn, CeHe(y)- 
k=0 


Show that fx" K(y,dz) = Pn(y) y-a.e. and that limy.. y7"Pr(y) = Cn. 
(2) Let o(y, dt) be the image of K(y, dx) under the mapping z+> 2/y. For 
6 € C, show that 


2 


+00 Q oo C 
/ exp(6t)o(y, dt) = exr( 35) S> sr Oty Hey) 
k=0 


—oo 
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and 


+00 OP 

lim / exp(6t)o(y, dt) = S° Chon. 

yr i 
eos k=0 

(3) Show that the probability measure o(dt) = limy.x o(y, dt) exists and 

that 


Cy = f to(an. 


From the fact that |C;,| < 1, conclude that a(R \ [—1, 1]) = 0. 

(4) Show that o is the unique probability measure on [—1,1] such that 
Gy = (eae ata), 

(5) Show that the mapping yz + o, from C to the set of probability measures 
on [—1, 1], is a bijection. What is y. when o is the Dirac measure at p? 


METHOD. For (5), consider successively the cases where p = 1, p = —l, 
and (using Problem V-5(2) and Mehler’s formula, V-1.5.8(ii)) p < 1. 


REMARK. This phenomenon was observed by O. Sarmanov (1966) and 
generalized by Tyan, Derin, and Thomas (1976). 
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